Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
750 views
in Technique[技术] by (71.8m points)

apache-spark - 如何用消费者的热情衡量卡夫卡话题的滞后?(how to measure the lag of a kafka topic with consumer being a spark?)

We have a spark batch application that uses kafka as output.

(我们有一个使用kafka作为输出的spark batch应用程序。)

Is there a way to monitor Kafka consumer group lag when using spark?

(使用星火时,有没有办法监测卡夫卡消费者群体的滞后?)

We checked the spark documentation and we found that spark always create a random groupId and uses a checkpoint dir to controls the offset.

(我们检查了spark文档,发现spark总是创建一个随机的groupId并使用检查点dir来控制偏移量。)

Is there an approach or an application that you guys recommend that we use to measure the lag of this process?

(你们是否建议我们使用一种方法或应用程序来衡量此过程的时滞?)

We could not find a way to do it.

(我们找不到办法。)

  ask by user3308496 translate from so

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

What spark version are you using?

(您正在使用什么星火版本?)

AFAIK, you can specify group.id , it's not random.

(AFAIK,您可以指定group.id ,它不是随机的。)

Also, you don't have to use checkpoint dir, you can use kafka itself for managing the offsets (refer to https://spark.apache.org/docs/2.4.0/streaming-kafka-0-10-integration.html#storing-offsets )

(另外,您不必使用检查点目录,可以使用kafka本身来管理偏移量(请参阅https://spark.apache.org/docs/2.4.0/streaming-kafka-0-10-integration。 html#storing-offsets ))

Last but not least, I like to measure the lag from both sides - consumer metrics and broker side.

(最后但并非最不重要的一点是,我喜欢从双方(消费者指标和经纪人方面)来衡量滞后。)

The reason is that I have seen cases in which a consumer group was allegedly assigned to all partitions, but in some cases no offset was reported:

(原因是我看到了一些案例,其中有一个消费者组据称分配给了所有分区,但是在某些情况下没有报告偏移量:)

TOPIC     PARTITION  CURRENT-OFFSET  LOG-END-OFFSET  LAG        CONSUMER-ID
topic1     0          45896605954     46222875284     326269330  consumer-1
topic1     1          45888273257     46227424210     339150953  consumer-1
...
topic1     16         45678505506     46013061139     334555633  consumer-1
topic1     17         -               46225917726     -          consumer-1
topic1     18         45893413333     46225853655     332440322  consumer-1

So the consumer itself didn't report any lag on partition 17, and the only way to catch it was describing the consumer group from broker side and parse it...

(因此,消费者本身没有报告分区17的任何滞后,捕获该分区的唯一方法是从经纪人角度描述消费者组并对其进行解析...)


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...