We are working on Kafka based event collection system. This needs to gather
events from across data centers. Lets say all the events will be produced
in DC1 while kafka brokers and consumers are lying in DC2. Round trip
between DC1 and DC2 can be around ~80 ms. Number of events should be around
~50 million a day, peak being ~5K events a day, data volume ~100GB a day,
peak being ~10MB a day. What is the best way to do it.
--- Is keeping the producer is DC1 sending events to DC2 a good idea.
--- Should my ZK quorum lie only in DC2 or should it spawn across both DC1
--- Will this problem be solved easily in version .8.0 through broker
replication by keeping brokers in both DC1 and DC2.
Thanks & Regards,
Jun Rao 2013-02-04, 05:26
Apoorva Gaurav 2013-02-04, 06:02
Jun Rao 2013-02-04, 16:45
Apoorva Gaurav 2013-02-04, 17:30
Jun Rao 2013-02-05, 00:53
Apoorva Gaurav 2013-02-04, 06:03
S Ahmed 2013-02-04, 14:37
Apoorva Gaurav 2013-02-04, 15:04