Trade Sequence

This project demonstrates how to process time-series data using Apache Crunch using the simple example of sequencing trades for each stock by time.

Running

The compiled program can be run on a Hadoop cluster with:

hadoop jar target/tradesequence-0.0.1-SNAPSHOT-job.jar /your/hdfs/input/directory /your/hdfs/output/directory

Test data

A small test data JSON file is provided in src/main/avro. On a CDH5 cluster it can be converted to an Avro file using src/main/avro/create_test_avro.sh. On another Hadoop distribution you can alter the script to point to your avro-tools location. The Avro data file can be used as the input for the job.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
src/main		src/main
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Trade Sequence

Running

Test data

About

Releases

Packages

Languages

jeremybeard/tradesequence

Folders and files

Latest commit

History

Repository files navigation

Trade Sequence

Running

Test data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages