Job Description
Location: New York Allium makes blockchain data accurate, simple and fast Blockchain data is hard, messy, and chaotic When we started out in late 2021 our thesis was simple – blockchain data, despite it being public and free, was difficult to understand, clunky to access and troublesome to maintain. Answering a simple question like “Who are the biggest Ethereum token holders over time?” requires an engineering team to run their own RPC nodes, ingest the full history of the blockchain, clean the data, transform the data and finally summon a wizard to cast a complex SQL query.
Accessing data is hard because blockchains are optimized for Writes and not Reads Blockchains have historically been optimized for Writes (getting data onto the blockchain) and less for Reads (getting data out of the blockchain). This focus on transaction throughput and fault‑tolerant consensus has made it hard to get data out efficiently and reliably at scale. Parsing and interpreting blockchain data requires both deep domain expertise and data manipulation Blockchains are virtual computers, not databases.
They support general computations, and anyone can write and deploy their own smart contract for their own use case. The resulting fragmentation of data schemas requires deep domain expertise to turn esoteric outputs into clear information for concepts like tokens, NFTs, stable coins and DEXs. Allium abstracts the complexity with a simple way to query blockchain data Allium tames the chaos by ingesting, sanitizing, and standardizing all this data. As of this post, the data we’ve archived across 40+ blockchains is in the petabytes and growing exponentially.
bolthires and Bloomberg had to organize the world’s public financial and webpage data – Allium is on a mission to do the same for blockchain data We index a giant public dataset that is sorely needed by everyone – similar to what Bloomberg did for financial data and what bolthires organized for public webpages. With this indexed data we support trailblazers in industry trends such as NFTs, stable coins and decentralized exchanges. About our customers We serve two groups of customers with the same data but different platforms: Analysts who need to answer data questions (BI focus) and Engineers who need highly reliable, near‑real‑time queryable data (application backends).
Our customers include Visa, Stripe, Grayscale, Phantom, Uniswap, and other major institutions and crypto companies. About the Role We love engineers who solve new problems every single day. Responsibilities • Data egress – How to transport hundreds of terabytes of data worldwide without breaking the piggybank. • Handle high traffic – Support the biggest applications and handle 100,000 QPS at peak traffic without downtime. • Botnets – Detect botnets based on behavioral patterns in the early days of the industry.
• Fraud (Sybil) detection – Transfer fraud detection heuristics into the blockchain world. • Who is real? – Define meaningful and organic transactions on the blockchain. • Bring
Your Own Transformation – Let customers design their own APIs and transform their own real‑time data streams. • Data governance – Ensure data consistency across every copy and every region 24/7. • AI and LLMs – Design the LLM and AI experience on top of our data to lower the barrier of entry to crypto data. • Data transformation holy grail – Unify streaming and batch transformation logic into a single code base.
If any of those bore you, we have many more problems to solve. Allium sizzle reel Giant infrastructure budget per head You will make mistakes – costly mistakes – but at Allium’s expense. We have an internal leader board of the costliest infrastructure mistakes made, and we learn from them. We provide a huge infrastructure budget to help you refine your craft. We leverage every tool (no prerequisites) because we meet our enterprise customers where they are at: • Every OLAP: Snowflake, Databricks, Bigquery, Clickhouse* • Every OLTP: Postgres, Aurora • Every event bus: Kafka, SNS, Pub Sub • Every cloud provider: AWS, GCP, Azure (one day) • A copy of data in every region: US East, Central, West, Europe, Asia • Every data transformation and orchestration tool: Apache Beam, Materialize, Tiny Bird, DBT, SQLMesh, Temporal • Data governance tools: Data Fold We invite people of all backgrounds.
Engineers who started coding late, who learned on the side, who are still in school, who went to top schools, are all welcome if you bring a curious mind and an infectious work ethic. Administrative Benefits Medical, Dental, Vision, Life and AD&D insurance – US folks get 100% coverage for Gold plans, 80% for dependents. Note:
The sun never sets on Allium – we hire from any geographic location as long as you can overlap two hours of NYC morning overlap Mon‑Thurs from 10am‑12pm ET. We have people based in New York, Seattle, Singapore, and Australia.
#J-18808-Ljbffr Apply tot his job Apply tot his job