Python3

Dukascopy Forex Tick Data

Abstract In my exploration of world of big data and I became curious about tick data. Tick data is extremely granular and provides a great challenge for those looking to work on their optimization skills due to its size. Unfortunately, market data is almost always behind a pay wall or de-sampled to the point of uselessness. After discovering the Dukascopy api, I knew I wanted to make this data available for all in a more accessible format.

Pacback

https://github.com/JustinTimperio/pacback Being at the head of Linux kernel and application development means access to the latest features but also often means dealing with the latest bugs. While I don’t run into major bugs often, when they happen, they cripple my productivity. Reversing individual packages is generally a slow manual process and while some tools exist, none meet my needs. In particular, support for downgrading AUR packages is extremely lacking.

Parallel Archiving Techniques

The .tar.gz and .zip archive formats are quite ubiquitous and with good reason. For decades they have served as the backbone of our data archiving and transfer needs. With the advent of multi-core and multi-socket CPU architectures, little unfortunately has been done to leverage the wider number of processors. While archiving then compressing a directory may seem like the intuitive sequence, we will show how compressing files before adding them to a .

Preventing Archives From Freezing Ranger

Ranger is one of the few packages I simply can’t live without. Ranger is one of my most used CLI tools and my defacto standard for navigating directories and transferring files. If you love ranger like me though you have come to deal with its many bugs and excentricities. One of the most frustrating for me personally is rangers automatic previewing of archives. While this feature is great if you are dealing with nothing but tiny archives, but not in a directory full of +100GB archives… At that point ranger becomes unusable and a general nightmare.

GDELT-Diff

https://github.com/JustinTimperio/gdelt-diff This small tool is designed to automate the download, orginization, and storage of GDELT source files. GDELT-Diff includes a deamon that runs every 60 mins fetching any new or missing files and sorts them into folders for easy storage. Additionally, an extremely lightweight tool is provided to maintain a copy of only the streams most recent files in /tmp/gdelt-live. This is for anyone doing real-time analysis of the GDELT and doesn’t require a full copy of the source files.