Skip to content

Conversation

@mcarmonaa
Copy link
Contributor

Changes:

  • repack command: it only repacks watchers.csv and projects.csv from
    the ghtorrent's mysql dump.

  • discover command: it only generates one file named repositories.csv.gz
    containing all the information needed in the next steps
    (repository name, number of stars).

  • select command: it can only filter by number of stars. It generates
    two files: a list on the stdout of one url repository per line to
    feed borges and a file named repositories-index.csv.gz (filtered from
    repositories.csv.gz) to be used by the index generation command.

  • index command: it will use the repositories.csv.gz by default. To give
    it the filtered information in repositories-index.csv.gz the flag -r
    must be used.

Signed-off-by: Manuel Carmona manu.carmona90@gmail.com

Signed-off-by: Manuel Carmona <manu.carmona90@gmail.com>
Signed-off-by: Manuel Carmona <manu.carmona90@gmail.com>
Changes:

- repack command: it only repacks watchers.csv and projects.csv from
  the ghtorrent's mysql dump.

- discover command: it only generates one file named repositories.csv.gz
  containing all the information needed in the next steps
  (repository name, number of stars).

- select command: it can only filter by number of stars. It generates
  two files: a list on the stdout of one url repository per line to
  feed borges and a file named repositories-index.csv.gz (filtered from
  repositories.csv.gz) to be used by the index generation command.

- index command: it will use the repositories.csv.gz by default to give
  it the filtered information in repositories-index.csv.gz the flag -r
  must be used.

Signed-off-by: Manuel Carmona <manu.carmona90@gmail.com>
@mcarmonaa mcarmonaa requested a review from a team April 10, 2019 12:24
Add GHTORRENT_MYSQL env to the command struct tag and format the
findMostRecentMySQLDump function.

Signed-off-by: Manuel Carmona <manu.carmona90@gmail.com>
@mcarmonaa mcarmonaa requested a review from jfontan April 11, 2019 07:53
@jfontan jfontan merged commit 9949438 into src-d:master Apr 11, 2019
@mcarmonaa mcarmonaa deleted the improvement/generated-files branch April 15, 2019 12:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants