Flexible ingest framework: A scalable architecture for dynamic routing through composable pipelines

Alexei Samoylov,Jason Schlachter

Flexible ingest framework: A scalable architecture for dynamic routing through composable pipelines

2015

Alexei Samoylov
Jason Schlachter

In this paper we describe a flexible and scalable big data ingestion framework based on Apache Spark. It is flexible in that meta-information about the data is used to build custom processing pipelines at run-time. It is scalable in that it leverages Apache Spark with minimal additional overhead. These capabilities allow a user to setup custom big data processing pipelines capable of handling changing data types without the need to recompile code in an operational environment. This is particularly advantageous in secure environments where recompilation is undesirable or unattainable.

Keywords:

Big data
Computer science
Adaptive routing
Pipeline transport
Data processing
Scalability
Data type
Architecture
Spark (mathematics)
Embedded system
scalable architecture

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations