Skip to content

Latest commit

 

History

History
61 lines (42 loc) · 1.81 KB

README.md

File metadata and controls

61 lines (42 loc) · 1.81 KB

NiFi RSS Processor

Overview

This project is a custom NiFi processor that extracts fields from a Flow File with content that is valid RSS XML. The processing is done by the ROME Java framework for RSS and Atom feeds.

The processor creates a JSON object with the following fields - with the field name in parenthesis:

  • Publication date (publicationDate)
  • Author (author)
  • Item title (title)
  • Item description (description)
  • Item url (url)
  • Title of feed (feedTitle)
  • Url of feed (feedUrl)

Processor Work Flow

The expected input to this Processor is a NiFi flow file where the content is an XML document that can be processed by the ROME RSS framework.

The output from the processor is one (or more) flow files:

  • The original flow file will pass to either the SUCCESS or FAILURE relationship depending on whether ROME was able to parse the XML input.
  • A new flow file will be created for each Item detected in the RSS XML content. The format of the content of the new flow files is described above.

Using the Processor

This project was generated from the NiFi maven archetype and the build process follows the approach for a project created from the archetype.

Building NiFi NAR

First step is to clone the repository.

git clone https://github.com/gm-ram/nifirssbundle.git

In the root directory of the project run:

mvn clean install

If the build is successful there will be a nar file in the folder nifi-rss-nar/target.

Adding to NiFi

The nar file created after the build process needs to be copied into the lib folder of the NiFi installation.

The final stage is to restart NiFi so that it picks up the new Processor.

License

  • Apache NiFi - Apache License, Version 2.0
  • ROME - Apache License, Version 2.0