Skip to content

tienhuynh-tn/Wikimedia-data-processing

Repository files navigation

Wikimedia Data Processing

Table of Contents

Description

  • We're going to read a huge amount of real-time stream data from Wikimedia to the database. We will create a Kafka producer which will read real-time stream data from Wikimedia and then it will write that data to the Kafka broker. Hence, we create a Kafka consumer that will consume the real-time stream data from the Kafka broker and then it will write to the database.
  • The real-time stream data from Wikimedia can be found through this link: https://stream.wikimedia.org/v2/stream/recentchange
  • TBU

Technology

1. Backend

  • Java Language | version 11
  • Spring Boot Framework | version 2.5.12
  • Kafka
  • Docker
  • Apache Maven 3.8.4

2. Database

  • MySQL

3. Tools

  • IntelliJ IDEA 2024.1.4 (Community Edition)
  • Docker Desktop 4.31.1

How to set up?

Run Docker:

  • TBU

How to run?

TBU

References

License & Copyright

© 2024 tienhuynh-tn Licensed under the MIT LICENSE.