BRIJ RAJ KISHORE brijrajk

👋 About Me

🏢 Senior MTS @ Zettabolt Technologies Pvt. Ltd.
🎓 M.Tech Computer Science — NIT Nagpur
🌍 Based in India
🔥 Working on Apache Spark, Meta Velox & Apache Gluten for native query acceleration
🤖 Building PyTorch OOT device backends for LLM acceleration on custom hardware
⚡ Contributing to vLLM — high-throughput LLM inference engine
🔌 FPGA / Xilinx — hardware-accelerated data pipelines
🧠 5x speedup on Spark via Neuroblade SPU integration (now @ AWS)
🛠️ Built ZettaProf — Spark profiling tool that cuts debug time by 60%
🌱 Active OSS contributor — Apache Spark, Gluten, Velox, PyTorch, vLLM
💬 Ask me about Big Data, C++, LLM infra, distributed systems
⚡ Fun fact: I push query engines into silicon 🔌

🛠️ Tech Arsenal

Specialized domains: Apache Spark · Apache Gluten · Velox · PyTorch OOT Backends · vLLM · LLM Acceleration · FPGA (Xilinx/AMD) · Data Analytics · Data Engineering · Parquet/Arrow · Column-store engines · Query optimization

📊 GitHub Stats

🧩 LeetCode Stats

📈 Contribution Activity

📊 Detailed GitHub Metrics

🌱 Currently Exploring

▶  PyTorch OOT Device     — custom accelerator backend, device guard, aten ops, dispatcher
▶  LLM Acceleration       — inference optimization, kernel fusion, operator dispatch on custom HW
▶  AI Agent Benchmarking  — evaluating benchmark validity & designing rigorous evaluations
▶  Velox internals        — expression evaluation, vectorization, runtime optimization
▶  FPGA-accelerated SQL   — pushing query engines into hardware

🌍 Open Source Contributions

Active contributor to major Big Data and AI projects. PRs span bug fixes, feature additions, test infrastructure, and documentation across the Spark/Velox/Gluten/PyTorch/vLLM ecosystem.

✅ Merged · 🔄 Open · ❌ Closed without merge

🤖 PyTorch

PR	Description	Status
#185694	[library] Improve infer_schema error message when future annotations cause NameError	✅ Merged
#185756	[clamp] Fix float16 scalar overflow check inconsistency between CPU and GPU	🔄 Open
#185751	[nn] Raise ValueError early for invalid (ndim, pad_size) in non-constant F.pad modes	🔄 Open

⚡ vLLM

PR	Description	Status
#44349	[Tests] Gate Step3VL under Transformers v5	🔄 Open

🚀 Apache Gluten

PR	Description	Status
#12199	[MINOR][VL] Re-enable stale ignored atan2 test in MathFunctionsValidateSuite	🔄 Open
#12158	[GLUTEN-12157][VL] Fix silently-skipped math/scalar test suites; add Velox native tests for sin, tan, tanh, radians, ln	🔄 Open
#12151	[GLUTEN-12013][VL] Fix bloom-filter bytes corruption on whole-stage AQE fallback	🔄 Open

🧠 Velox

PR	Description	Status
#17677	test(parquet): Verify WriterOptions::encoding is forwarded to Arrow writer	✅ Merged
#17676	docs: Fix duplicate object description warnings in Sphinx doc build	✅ Merged
#17675	docs(geospatial): Expand convex_hull_agg and geometry_union_agg docs	✅ Merged
#17669	feat: Register Spark transform_values function	✅ Merged
#17668	perf(tpcds): Eliminate redundant map allocations in toTableName and fromTableName	✅ Merged

🔥 Apache Spark

PR	Description	Status
#56154	[SPARK-49798][DOCS] Fix inaccurate documentation of RuntimeConfig.get	✅ Merged
#56250	[SPARK-56561][PYTHON][DOCS] Document order preservation for array_distinct, array_intersect, array_union, array_except	🔄 Open
#56248	[SPARK-34679][DOCS] Add inferTimestamp option to JSON data source options table	🔄 Open
#56178	[SPARK-40437][SS][PYTHON] Support string representation of durationMs in GroupState.setTimeoutDuration	🔄 Open
#56174	[SPARK-43847][PYTHON] Throw structured error when reading Protobuf descriptor file fails	🔄 Open

📦 aws-samples/aws-etl-orchestrator

PR	Description	Status
#9	Migrate to Python3.12	🔄 Open

📦 duckdb/duckdb

PR	Description	Status
#23104	Fix *COLUMNS() false rejection when operators appear in lambda bodies	❌ Closed

📦 google/it-cert-automation-practice

PR	Description	Status
#2336	Closes: #1	🔄 Open

📌 Featured Work

Most of my production work lives in private repositories at Zettabolt Technologies Private Limited. The best window into my hands-on contributions is the open source PRs above — real code, real reviews, real projects.

My pinned repos include forks of the projects I actively work in:

🚀 apache/gluten — Spark native execution offloading to Velox
🧠 facebookincubator/velox — C++ vectorized execution engine
🔥 apache/spark — Distributed analytics engine
🤖 pytorch/pytorch — OOT device backend work

💬 Dev Quote

🐍 Contribution Snake

github-snake

🤝 Connect With Me

Open to conversations on Big Data infrastructure, accelerated computing & LLM systems.

"From the query engine to the transformer kernel — it's all just bytes waiting to go faster."

Provide feedback

Saved searches

Use saved searches to filter your results more quickly