Skip to content

Commit 2b45d2e

Browse files
authored
Merge pull request #1 from jeromatron/master
Updated naming and links.
2 parents 12c8d13 + a5544d5 commit 2b45d2e

File tree

1 file changed

+12
-12
lines changed

1 file changed

+12
-12
lines changed

README.md

Lines changed: 12 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,27 +1,27 @@
1-
# Loading Data into Apollo with DataStax Bulk Loader
2-
If you're trying to load data into Apollo from a CSV file or from an existing Cassandra table, then you've come to the right place. This example shows how to quickly load data into Apollo using the [DataStax Bulk Loader](https://docs.datastax.com/en/dsbulk/doc/index.html) (DSBulk for short).
1+
# Loading Data into Astra with DataStax Bulk Loader
2+
If you're trying to load data into Astra from a CSV file or from an existing Cassandra table, then you've come to the right place. This example shows how to quickly load data into Astra using the [DataStax Bulk Loader](https://docs.datastax.com/en/dsbulk/doc/index.html) (DSBulk for short).
33

44
Contributor(s): [Dave Bechberger](https://github.com/bechbd) based on the work of [Brian Hess](https://github.com/brianmhess)
55

66
## Objectives
77

8-
* Show how to load data into Apollo from a CSV file on the filesystem or from an existing table in Cassandra
8+
* Show how to load data into Astra from a CSV file on the filesystem or from an existing table in Cassandra
99

1010
## Project Layout
1111

1212
* [data.csv](data.csv) - The CSV data to load
1313
* [schema.cql](schema.cql) - The CQL schema used for this example
1414

1515
## How this Works
16-
Loading data into Apollo using DSBulk is much like loading data into other Cassandra databases with the addition of the requirement to specify the [secure connect bundle](https://docs.datastax.com/en/apollo/aws/doc/dscloud/apollo/dscloudObtainingCredentials.html) as well as the username and password for your Apollo database.
16+
Loading data into Astra using DSBulk is much like loading data into other Cassandra databases with the addition of the requirement to specify the [secure connect bundle](https://docs.datastax.com/en/astra/aws/doc/dscloud/astra/dscloudObtainingCredentials.html) as well as the username and password for your Astra database.
1717

1818
The secure connect bundle is specified using the `-b <INSERT PATH>` parameter on the command line. [See here for more details](https://docs.datastax.com/en/dsbulk/doc/dsbulk/reference/driverOptions.html#driverOptions__driverBasicCloudSecureConnectBundle)
1919

2020
The username is specified using the `-u <INSERT USERNAME>` parameter on the command line. [See here for more details](https://docs.datastax.com/en/dsbulk/doc/dsbulk/reference/driverOptions.html#driverOptions__datastaxJavaDriverAdvancedConnectionAuthProviderUsername)
2121

2222
The password is specified using the `-p <INSERT PASSWORD>` parameter on the command line. [See here for more details](https://docs.datastax.com/en/dsbulk/doc/dsbulk/reference/driverOptions.html#driverOptions__datastaxJavaDriverAdvancedConnectionAuthProviderPassword)
2323

24-
This example only touches the tip of the iceberg of functionality. DSBulk has all the functionality to perform complex loading operations to Apollo as it does to other DDAC and DSE clusters. Check out the docs below for details of the other things it can do:
24+
This example only touches the tip of the iceberg of functionality. DSBulk has all the functionality to perform complex loading operations to Astra as it does to other DDAC and DSE clusters. Check out the docs below for details of the other things it can do:
2525

2626
* [DataStax Bulk Loader Documentation](https://docs.datastax.com/en/dsbulk/doc/)
2727
* [DataStax Bulk Loader: Introduction and Loading](https://academy.datastax.com/content/datastax-bulk-loader-introduction-and-loading)
@@ -34,17 +34,17 @@ This example only touches the tip of the iceberg of functionality. DSBulk has al
3434
### Prerequisites
3535

3636
* DS Bulk v1.4.0 or greater
37-
* An Apollo cluster with the schema ([from schema.cql](schema.cql)) loaded and credential information
38-
**Note** If you need further instruction on how to obtain the secure connect bundle for your Apollo instance then please refer to the documentation located [here](https://docs.datastax.com/en/apollo/aws/doc/dscloud/apollo/dscloudObtainingCredentials.html).
37+
* An Astra cluster with the schema ([from schema.cql](schema.cql)) loaded and credential information
38+
**Note** If you need further instruction on how to obtain the secure connect bundle for your Astra instance then please refer to the documentation located [here](https://docs.datastax.com/en/astra/aws/doc/dscloud/astra/dscloudObtainingCredentials.html).
3939
* A Cassandra cluster (optional if you want to load from Cassandra)
4040

4141
### Running
4242

43-
To migrate data into Apollo using DS Bulk you first need to ensure that the target Apollo keyspace has had the schema for the `video_ratings_by_user` table created. This is done via using the DataStax Developer Studio that is embedded in your Apollo instance. For more information on how to use the embedded Studio instance please check the documentation located [here](https://docs.datastax.com/en/apollo/aws/doc/dscloud/apollo/dscloudConnectStudio.html).
43+
To migrate data into Astra using DS Bulk you first need to ensure that the target Astra keyspace has had the schema for the `video_ratings_by_user` table created. This is done via using the DataStax Developer Studio that is embedded in your Astra instance. For more information on how to use the embedded Studio instance please check the documentation located [here](https://docs.datastax.com/en/astra/aws/doc/dscloud/astra/dscloudConnectStudio.html).
4444

4545
#### Loading from CSV
4646

47-
Here is an example command that will load the data.csv file into the `video_ratings_by_user` table in your Apollo instance.
47+
Here is an example command that will load the data.csv file into the `video_ratings_by_user` table in your Astra instance.
4848

4949
**Note** This loads the data from the file stored in the github repo so the machine running this command will need access to the internet.
5050

@@ -54,18 +54,18 @@ Here is an example command that will load the data.csv file into the `video_rati
5454

5555
#### Loading from an existing Cassandra table
5656

57-
To load data from an existing table in a Cassandra keyspace into Apollo there are two options to accomplish this.
57+
To load data from an existing table in a Cassandra keyspace into Astra there are two options to accomplish this.
5858

5959
##### Option 1 - Unload and Load in Separate Steps
60-
The first option for loading data from an existing Cassandra cluster into Apollo requires that you unload the data from the Cassandra cluster into a local file and then load the data into Apollo. The commands to accomplish this look like this:
60+
The first option for loading data from an existing Cassandra cluster into Astra requires that you unload the data from the Cassandra cluster into a local file and then load the data into Astra. The commands to accomplish this look like this:
6161

6262
```
6363
./dsbulk unload -h <CASSANDRA CLUSTER IP> -k <KEYSPACE NAME> -t video_ratings_by_user -url /path/to/file/migrate.csv
6464
./dsbulk load -url /path/to/file/migrate.csv -b /path/to/bundle.zip -k <KEYSPACE NAME> -t video_ratings_by_user -u <USERNAME> -p <PASSWORD>
6565
```
6666

6767
##### Option 2 - Unload and Load by Chaining Steps
68-
The second option for loading data from an existing Cassandra cluster into Apollo requires that you unload the data from the Cassandra cluster and pipe that into a command load the data into Apollo. This has some advantages as it will run in a single command but it will only run single threaded as it uses stdin/stdout. The commands to accomplish this look like this:
68+
The second option for loading data from an existing Cassandra cluster into Astra requires that you unload the data from the Cassandra cluster and pipe that into a command load the data into Astra. This has some advantages as it will run in a single command but it will only run single threaded as it uses stdin/stdout. The commands to accomplish this look like this:
6969

7070
```
7171
./dsbulk unload -h <CASSANDRA CLUSTER IP> -k <KEYSPACE NAME> -t video_ratings_by_user -url /path/to/file/migrate.csv | ./dsbulk load -url /path/to/file/migrate.csv -b /path/to/bundle.zip -k <KEYSPACE NAME> -t video_ratings_by_user -u <USERNAME> -p <PASSWORD>

0 commit comments

Comments
 (0)