Skip to content

[NH-011] - Improve consume feed process #11

@danielmasegosa

Description

@danielmasegosa

We must improve the process of collect feeds from opml file.

Expected Behavior
Several RSS resources have been deprecated and others can't be parsed correctly by the Rome library. The list which contains all theses resources is:

  • AdRoll
  • Arkency
  • Artsy
  • Avenue Code
  • Azavea
  • Blogfoster
  • BloomReach
  • Boxever
  • Cerner
  • Coinbase
  • Convox
  • Data Artisans
  • DeferPanic
  • Deliveroo
  • DoorDash
  • Expedia
  • Flickr
  • Flipboard
  • Foursquare
  • Funding Circle
  • Future Processing
  • Geoblink
  • GitHub Old
  • Google Online Security
  • Grab
  • Grouper
  • Harry's
  • Heroku
  • Honeybadger
  • Hootsuite
  • IFTTT
  • IMVU
  • Intel
  • Intent HQ
  • Intercom
  • InVision
  • Just Eat
  • King
  • Laterooms
  • LiveChat
  • Made Tech
  • MailChimp
  • MemSQL
  • Microsoft Python Engineering
  • Mozilla Automation Team
  • Mozilla Release Engineering
  • Nvidia
  • Opensooq
  • Paypal
  • REA Group
  • Realm.io
  • Remind
  • RisingStack
  • Scrunch
  • Segment
  • Semantics3
  • Semaphore CI Engineering
  • Sensible
  • Sharethis
  • Shazam
  • Shopify
  • Snyk
  • Takipi Target
  • theScore
  • Tinder
  • TripAdvisor
  • TrueCar
  • Tumblr
  • Twitter
  • Venmo
  • Vine
  • VNGRS
  • Wattpad
  • We Are Wizards
  • Wingify
  • WyeWorks
  • Xmartlabs
  • Yahoo
  • Zenefits
  • Addy Osmani
  • Anders Aarvik
  • Andreas Schuster
  • Andrew Bancroft
  • Andrew Ray
  • Andrey Akinshin
  • Barry Warsaw
  • Ben McCormick
  • Bill the Lizard
  • Blundell's Android Tutorials
  • Brandon Rhodes
  • Chris Wellons
  • Code with style!
  • Dave Atchley
  • Discover Meteor
  • Evan Jones
  • Ievgen Kuzminov
  • Ilija Eftimov
  • Ilya Grigorik
  • Jake Wharton
  • James Hague
  • James Long
  • Jan Lelis
  • Jesus Castello
  • Jon Kensy
  • Jonathan Dekhtiar
  • Juan Treminio
  • Larry Land
  • Lerner Consulting Blog
  • Luciano Mammino
  • Manu Sporny
  • Matt Aimonetti
  • Matt Warren
  • Michaël Gallego
  • Michael Herman
  • Mike Ash
  • Peteris Krumins
  • Petr Mitrichev
  • Philipp Oppermann
  • Radek Pazdera
  • Regular Geek
  • Sam Saffron
  • Sergey Zhuk
  • Scott Johnson
  • Stack Abuse
  • Stefan Parker
  • The Coded Self
  • Thomas Young
  • Todd Motto
  • Yifan Lu
  • Android
  • Atom.io
  • React
  • RocksDB
  • Vertabelo

Current Behavior
Multiple errors are produced through the collecting process. The errors are caught in the RssFeedListener service are shown in a log trace.

Steps to reproduce
For reproducing the current behavior you need:

  • Up and running docker-compose stack
  • Run FeedCollectorApplication

Steps to fix
A good practice to fix theses errors could be:

  • In a unit test, similar to RssFeedListenerTest, you could reproduce the parser process of a feed through Rome library.
  • If the error could be solved, do and pushed it. Otherwise, the feed must be deleted from opml file in the collector artifact.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions