PDQ ATS research system
- Users new to R
- Mostly TAQ level II data
- Running in Amazon EC2
- Research focused on transaction costs
PDQ ATS research system
Support diverse exploratory research programs…
…involving all tickers over days to months or so…
…lightweight software footprint, easy to use
You know how this goes
Eventually, some of those arrows connect to R
Moving data around data centers not as big of a deal as you might think
http://www.eecs.berkeley.edu/~ganesha/disk-irrelevant_hotos2011.pdf
Cache/fast storage reuse is usually more important.
Marshaling those data in and out of various formats is probably a bigger deal than you think!
Yuck!
Feather: a common data frame serialization format for R and Python and …
SciDB streaming API
(experimental)
Databases a great at managing mutable data
Append-only data are much simpler to work with
Feathercache: a fast object store interface for R (experimental)
Simple GET/PUT/DELETE-like interface
Elastic computing with R and Redis on Amazon EC2
doRedis update due to CRAN soon, for now use GitHub
Vignettes
Amazon EC2 Recipe