You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Implemented data collection of scraped data. (#481)
* Added listen on all interfaces to bolt configuration for neo4j in the analysis ms
* Allowed more auth failure attempts
* Allowed more auth failure attempts
* Allowed more auth failure attempts
* Added default listen address and advertised address
* Disabling auth so that password can be reset
* Disabling auth so that password can be reset
* Adding auth back in to trigger password reset.
* Testing usage of neo4j
* Testing usage of neo4j
* changed tweet fetch interval to each minute, implemented deletion of cronjobs by default.
* Setting collection pod count to zero to trigger cronjob destruction
* Reinstated 3 collection ms replicas
* Updated lerna-lite version
* Updated lerna-lite version
* Added environment variables to analysis ms template, updated values.yaml files and updated circle config.
* Added environment variables to analysis ms template, updated values.yaml files and updated circle config.
* Added environment variables to analysis ms template, updated values.yaml files and updated circle config.
* Added environment variables to analysis ms template, updated values.yaml files and updated circle config.
* Added quotes around config values
* Updated values to override
* Updated values to override
* Updated values to override
* Removed quotes from config fields
* Testing access to complex value
* Testing access to complex value
* Testing access to complex value
* Testing access to complex value
* Testing access to complex value
* Testing access to complex value
* Testing access to complex value
* Testing access to complex value
* Testing access to complex value
* Updated apollo-datasource-neo4j version
* Started using neo4j ssc
* Started using neo4j defaults
* Added map reduce to compute sentiment.
* Added map reduce to compute sentiment.
* Updated tweet addition commands to merge with nodes already present. Testing to see if unique trees are formed in db.
* Updated memory and cpu usage for neo4j
* Updated query
* Added more memory to neo4j
* Reverted memory configuration as a result of do limits
* Removed recreate pods.
* bumped version of apollo-datasource-neo4j
* Bumped version of apollo-datasource-neo4j
* Updated querries to include named relationships
* Updated logging statement in analysis service
* Updated moment to use milliseconds, not seconds
* Added memory to resources of analysis neo4j
* Added some additional jvm arguments.
* Updated heap size for analysis neo4j
* Set heap sizes manually for analysis neo4j
* Made heap memory 3gb on analysis neo4j
* Testing something
* Reduced memory size to that in documentation. Upgraded size of volume connected for data storage.
* Changed cron job schedule to every 6 hours
* Updated timeline insertion query
* Modified neo4j queries
* Began addition of expected data model to front-end.
* Continued addition of expected data model to front-end.
* Continued implementing handling of new data model.
* Enabled http through defaults.
* Added TODO to data collector
* Modified front end to use specific lib in moment, modified analysis ms to add sentiment to neo4j
* Modified front end to use specific lib in moment, modified analysis ms to add sentiment to neo4j
* Updated stuff
* Updated subscription resolvers
* Updated analysis service to use sentiment score, not object
* Updated analysis service to use sentiment comparative score, not object
* Updated sentiment check
* Updated tweet check in neo4j data source
* Added logging statement to sentiment addition in analysis service
* Made start data collection async so execution of commands could be awaited. Updated necessary functions in collection service.
* Bumped version of KafkaJS used.
* Testing whether waiting causes correct reception of kafka messages.
* Testing whether waiting causes correct reception of kafka messages.
* trying something
* Added logging statements to inspect issue with addition of sentiment node
* Added deployment strategy to test result.
* Testing use of maxunavailable and max surge for rolling updates.
* Testing use of maxunavailable and max surge for rolling updates.
* Added minReadySeconds to deployment for testing
* More deployment tests
* Moved processing of economic entity memo in collection ms to immediate function
* Modified data collector to see if that's causing the issue
* Modified data collector to see if that's causing the issue
* Updated bitnami
* Eliminated fromBeginning for testing
* testing
* Started using addSentiments instead of addSentiment. Made addSentiment private due to requirement of creating dates, economicEntity first.
* Modified collection ms to only emit most recent tweet recording, modified edge and coll + analysis ms to use utcDateTime string instead of unix timestamp, modified neo4j operations to use supplied datetime(...) object. Standardizing on UTC date time strings.
* Instead of storing string in mongodb using date object.
* Adding recommended new Date(...). Seems strange but testing the result.
* Added logging statement to collection ms for failure to create tweets in datastore.
* Added logging statement to collection ms for failure to create tweets in datastore.
* Modified reduction operation to use utcDateTime in collection ms
* Removed tweet extraction from analysis ms compute sentiment.
* Used tweet text instead of tweet object in analysis service
* Used text in sentiment calculation instead of tweet object in analysis service
* Using 128 Gi instead of 64 after hanging of neo4j UI
* Got rid of 'not' sign in Number.isNAN(...) for neo4j store in analysis ms.
* Added logging statement to read function
* Mapped neo4j results to sentiment object.
* Updated front-end and back-end. testing results.
* Added fetch sentiments in neo4j store, mapped recent and arbitrary sentiment results to that expected by the client
* Switched Number out with Float in analysis ms graphql schema
* Updated storage to be 128 gigs for analysis neo4j
* Added utcDateTime to rows returned from readSentiment query for compatability with reducer, eliminated seamingly unecessary database operations after modification of apollo-datasource-neo4j
* Modified readSentiment query to include utcDateTime instead of dateTime.
* Modified readMostRecentSentiment query in neo4j store of analysis ms
* Changed neo4j store query executed based on neo4j docs
* Added logging statement for trouble shooting
* Made modifications based on query results.
* Made modifications on query.
* Updated neo4j store query in analysis ms
* Updated neo4j store query in analysis ms
* Enabled apoc procedures.
* Modified query
* Modified query
* Modified query
* Modified graphql scalar date to accept strings instead of ints
* Testing analysis ms schema using double array in graphql
* Renamed resolved in analysis m ms to match schema
* Modified query
* Modified query
* Removed null default from neo4j store reduction of database data in analysis ms.
* Ordered sentiment metch ascending, modified front-end sentiment graph and summary to operate with new data. Still have to do subscription.
* Ordered sentiment metch ascending, modified front-end sentiment graph and summary to operate with new data. Still have to do subscription.
* Modified get sentiment query for neo4j store of analysis ms
* Modified get sentiment query for neo4j store of analysis ms
* Made some modifications to front-end, eliminated some TODOs
* Added formatting to start date for start and end date date pickers, fixed issue where null wasn't correctly processed.
* Modified collection cron to occur every minute.
* Got rid of extend in schema def for subscription ms, also added filter based on endDate being null
* Commented check on null date for testing purposes.
* Updated filter function of subscription ms, updated analysis ms to send economicEntity object instead of name and type separately.
* Modified subscription to return payload data and not entire payload.
* Added logging statement to trouble-shoot subscription ms.
* Modified neo4j store query to return standard UTC format, set most recent data in analysis service of analysis ms to return data fromand not array
* Added logging statement to analysis ms for troubleshooting purposes.
* Eliminated overwhelming logging statement in analysis ms
* Added parenthesis around recent sentiment fetch in analysis ms
* Testing neo4j aggregate function collect(...)
* Testing neo4j aggregate function collect(...)
* Testing neo4j aggregate function collect(...)
* Testing neo4j aggregate function collect(...)
* Testing neo4j aggregate function collect(...)
* Testing neo4j aggregate function collect(...)
* Testing neo4j aggregate function collect(...)
* Testing neo4j aggregate function collect(...)
* Modified neo4j query to collect tweets and average sentiment causing group by utcDateTime, updated analysis ms
* Updated queries.
* Added filter to sentiment subscription to require a null end-date.
* Changed cronjob tweet fetch interval in collection ms back to every 6 hours.
* Eliminated some sub-todos in summary page code. Moved markup into functions to make render function more readable.
* Added time to front-end values sent back so that start and end date would be included.
* Added limit functionality to fetch sentiments
* Backed out limit changes
* Centralizing types for consistency and ease of maintenance.
* Added factory method for economic entity type, tied type into graphql schema definitions for change broadcasting, updated various components.
* Added factory method for economic entity type, tied type into graphql schema definitions for change broadcasting, updated various components.
* Added factory method for economic entity type, tied type into graphql schema definitions for change broadcasting, updated various components.
* Updated various functions to use economic entity object for shared definitions.
* Testing copy of type directory into docker image.
* Testing copy of type directory into docker image.
* Testing copy of type directory into docker image.
* Added creation of symbolic link to keep docker build context constraint during docker build.
* Added creation of symbolic link to keep docker build context constraint during docker build.
* Added creation of symbolic link to keep docker build context constraint during docker build.
* Attempting to run npm install in project and copy into image.
* Attempting to run npm install in project and copy into image.
* Attempting to run npm install in project and copy into image.
* Updated remaining circle ci config builds, updated remaining dockerfiles.
* Updated remaining circle ci config builds, updated remaining dockerfiles.
* Modified circle config ms build to only install one file, modified dockerfiles to copy the one file to the necessary directory.
* Testing use of relative path in file add
* Updated lock file, reverted circle ms build to use yarn install instead of add.
* Modified economic entity type validity check.
* Updated cert ids
* Changed neo4j block storage allocation for analysis ms to 64Gi
* Added toggling of dependency installs
* Made modifications due to readonly objects
* Updated some of the collection ms to operate on economic entity type.
* Modified configuration ms
* Updated analysis ms end date check to allow for nullability.
* Returned valid case for end date instead of when it's invalid.
* Updated tweet sentiment computed event to be sentiment computed, eliminated dead code, modified analysis ms to use economic entity objects.
* Updated tweet sentiment computed event to be sentiment computed, eliminated dead code, modified analysis ms to use economic entity objects.
* Updated much of the collection ms to use thinkdeep types
* Updated remaining portion of collection ms to use thinkdeep type
* added logging statement to collection ms economic entity memo for testing purposes.
* Added filter to read all to see if error subsides.
* Modified collection ms to return null if no results found in _readMemo
* Added await to retrieve array
* Separated await on toArray from filter method
* Updated validity check in filter function to be correct.
* Updated config ms to use thinkdeep type
* Updated subscription ms to use thinkdeep type
* Modified type to be compatible with front-end as well.
* Created util package to separate utilities such as validity checking, renamed thinkdeep type to model
* Updated necessary dockerfiles to include copy of util package.
* Modified front-end graphql files to be js to incorporate thinkdeep model, added tests to data collector.
* Implemented a couple tests.
* Implemented a couple tests.
* Added some tests around economic analyzer
* Added more tests.
* Added more tests.
* Added more tests
* Updated validEconomicEntity in type
* Fixed broken fe tests.
* Made some changes
* Made some changes
* Updated dependencies.
* Added tests around _computeSentiment of analysis ms
* Stubbed out analysis ms neo4j store tests
* Added tests around analysis ms
* Reintroduced collection ms tests and fixed them.
* Added/modified more tests. Updated model dependencies to include those related to testing.
* Added more tests
* Updated tests to be recursive, fixed broken tests in config and coll ms, fixed some code smells
* Updated tests to be recursive, fixed broken tests in analysis ms
* Modified circleci configs to include caching of node modules download for packages, updated environment variables.
* Updated neo4j helm chart version in analysis ms
* Updated e2e tests
* updated neo4j cert id
* Used correct moment import value for front-end
* Updated test finish timeout for web test runner
* Eliminated some code smells.
* Eliminated some code smells.
* Updated cron jobs to run minute to minute for subscription varification.
* Modified the subscription ms
* Modified subscription ms with logging statements to test failure of subscriptions on fe
* Modified subscription ms to create economic entity from payload, return payload.
* Modified subscription ms to create economic entity from payload, return payload.
* Backed out cron every minute.
* Added default advertised address to prod
* Added economic sector, sector type and factory to the model. Started using TDD for data-collector modifications.
* Made progress with site scraping. Added check for robots.txt to ensure sites allow it.
* Wrote tests around search engine interaction and robots.txt check.
* Made some naming changes
* Finished simple scraping of public allowed web pages. Added web site object to model.
* Switched to scraping branch
* Merging master into branch
* Extended timeout expectation
* Updated to newest neo4j helm chart for analysis ms
* Updated resource constrains
* Added changes to economic analyzer tests
* Updated prod config to include neo4j
0 commit comments