Projects

Table of Contents

A mix of open source projects and key professional contributions across distributed systems, data infrastructure, and developer tooling.

Open Source
#

nickboy/dotfiles

Dotfiles

Shell

nickboy/nickboy.github.io

Personal website for Nick

TypeScript

Professional Highlights
#

FB Feed Ranking Infrastructure
#

C++

ML Infrastructure

Ranking Systems

AWS Billing Unbilled Usage Auditor
#

Java

DynamoDB

AWS Lambda

Distributed Systems

Designed and built a system that detects unbilled usage across AWS services. Reduced false positives by 230 million monthly transactions and cut charge discrepancies by 300x — from $125,000 to $432.

View Project Details Read the Story

Twitter Fleet Kernel Automation
#

Python

Redis

Linux Kernel

Standardized hardware and software configurations for 5,000+ production hosts. Built a Python library for Canary Kernel Validation and designed automation for kernel updates at fleet scale. Resolved 140+ tickets in a single on-call week.

View Project Details Read the Story

Walmart Data Lake Integration
#

Apache Spark

Kafka

Apache Hudi

Big Data

Led enterprise-wide Data Lake integration for recruiting data pipelines, processing data from 10+ sources using Apache Spark, Kafka, and Apache Hudi. Architected scalable solutions for real-time data processing at enterprise scale.

Technical Stack
#

C++

Python

Java

TypeScript

Rust

Apache Spark

Kafka

Kubernetes

Docker

AWS

Terraform

Redis

DynamoDB

View Full Experience

Designed and built a distributed system that detects unbilled usage across all AWS services — reducing charge discrepancies by 300x and eliminating 230 million monthly false positives. Key Metrics # 300x Reduction in Discrepancies $125,000 → $432 230M False Positives Eliminated ~95% Alert Actionability Architecture # flowchart LR A["Usage Records\n(Billions/day)"] --> B["Smart Sampling\n& Aggregation"] B --> C["Multi-Signal\nValidation"] C --> D["Automated\nResolution"] D --> E{Real issue?} E -- Yes --> F["Alert with\nDiagnosis"] E -- No --> G["Auto-resolve\n& Log"] style B fill:#6366f1,color:#fff style C fill:#6366f1,color:#fff style D fill:#6366f1,color:#fff Technical Deep Dive # Aggregation over Brute-Force # Instead of checking every individual usage record (which generated 230M false positives), the system aggregates at the service-account-period level.

Open Source #

Professional Highlights #

FB Feed Ranking Infrastructure #

AWS Billing Unbilled Usage Auditor #

Twitter Fleet Kernel Automation #

Walmart Data Lake Integration #

Technical Stack #