Skip to content

Migrate data from an existing Cassandra cluster to Astra using a Spark application.

License

Notifications You must be signed in to change notification settings

DataStax-Examples/astra-spark-migration

 
 

Repository files navigation

Spark Migration Tool for Astra DB

40 minutes, Expert, Start Building

Migrate data from an existing Cassandra cluster to Astra DB using a Spark application.

image

Objectives

  • Leverage Spark to migrate data from a Cassandra cluster to Cassandra on Astra DB.

How this works

We're using Spark to migrate data from a Cassandra cluster to Cassandra on Astra DB.

Get Started

To build and play with this app, follow the build instructions that are located here: https://github.com/DataStax-Examples/astra-spark-migration

Running the Astra DB to Spark Migration Tool

Follow the instructions below to get started.

Prerequisites

Let's do some initial setup by creating a serverless(!) database.

DataStax Astra

  1. Create a DataStax Astra account if you don't already have one: image

  2. On the home page. Locate the button Create Database image

  3. Locate the Get Started button to continue image

  4. Define a database name, keyspace name and select a database region, then click create database. image

  5. Your Astra DB will be ready when the status will change from Pending to Active 💥💥💥 image

  6. After your database is provisioned, we need to generate an Application Token for our App. Go to the Settings tab in the database home screen. image

  7. Select Admin User for the role for this Sample App and then generate the token. Download the CSV so that we can use the credentials we need later. image

  8. After you have your Application Token, head to the database connect screen and select the driver connection that we need. Go ahead and download the Secure Bundle for the driver. image

  9. Make note of where to use the Client Id and Client Secret that is part of the Application Token that we generated earlier. image

Github

  1. Click Use this template at the top of the GitHub Repository: image

  2. Enter a repository name and click 'Create repository from template': image

  3. Clone the repository: image

About

Migrate data from an existing Cassandra cluster to Astra using a Spark application.

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Scala 100.0%