Skip to content
View brijrajk's full-sized avatar

Block or report brijrajk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
brijrajk/README.md

       

Typing SVG


👋 About Me

  • 🏢 Senior MTS @ Zettabolt Technologies Pvt. Ltd.
  • 🎓 M.Tech Computer Science — NIT Nagpur
  • 🌍 Based in India
  • 🔥 Working on Apache Spark, Meta Velox & Apache Gluten for native query acceleration
  • 🤖 Building PyTorch OOT device backends for LLM acceleration on custom hardware
  • ⚡ Contributing to vLLM — high-throughput LLM inference engine
  • 🔌 FPGA / Xilinx — hardware-accelerated data pipelines
  • 🧠 5x speedup on Spark via Neuroblade SPU integration (now @ AWS)
  • 🛠️ Built ZettaProf — Spark profiling tool that cuts debug time by 60%
  • 🌱 Active OSS contributor — Apache Spark, Gluten, Velox, PyTorch, vLLM
  • 💬 Ask me about Big Data, C++, LLM infra, distributed systems
  • ⚡ Fun fact: I push query engines into silicon 🔌



🛠️ Tech Arsenal

PyTorch   C++   Python   Java   Scala   Kafka   AWS   Docker   Kubernetes   Linux   Git   CMake

Specialized domains: Apache Spark · Apache Gluten · Velox · PyTorch OOT Backends · vLLM · LLM Acceleration · FPGA (Xilinx/AMD) · Data Analytics · Data Engineering · Parquet/Arrow · Column-store engines · Query optimization


📊 GitHub Stats

  


🧩 LeetCode Stats

LeetCode Profile


📈 Contribution Activity


📊 Detailed GitHub Metrics

GitHub Metrics


🌱 Currently Exploring

▶  PyTorch OOT Device     — custom accelerator backend, device guard, aten ops, dispatcher
▶  LLM Acceleration       — inference optimization, kernel fusion, operator dispatch on custom HW
▶  AI Agent Benchmarking  — evaluating benchmark validity & designing rigorous evaluations
▶  Velox internals        — expression evaluation, vectorization, runtime optimization
▶  FPGA-accelerated SQL   — pushing query engines into hardware

🌍 Open Source Contributions

Active contributor to major Big Data and AI projects. PRs span bug fixes, feature additions, test infrastructure, and documentation across the Spark/Velox/Gluten/PyTorch/vLLM ecosystem.

✅ Merged · 🔄 Open · ❌ Closed without merge

🤖 PyTorch

PR Description Status
#185694 [library] Improve infer_schema error message when future annotations cause NameError ✅ Merged
#185756 [clamp] Fix float16 scalar overflow check inconsistency between CPU and GPU 🔄 Open
#185751 [nn] Raise ValueError early for invalid (ndim, pad_size) in non-constant F.pad modes 🔄 Open

⚡ vLLM

PR Description Status
#44349 [Tests] Gate Step3VL under Transformers v5 🔄 Open

🚀 Apache Gluten

PR Description Status
#12199 [MINOR][VL] Re-enable stale ignored atan2 test in MathFunctionsValidateSuite 🔄 Open
#12158 [GLUTEN-12157][VL] Fix silently-skipped math/scalar test suites; add Velox native tests for sin, tan, tanh, radians, ln 🔄 Open
#12151 [GLUTEN-12013][VL] Fix bloom-filter bytes corruption on whole-stage AQE fallback 🔄 Open

🧠 Velox

PR Description Status
#17677 test(parquet): Verify WriterOptions::encoding is forwarded to Arrow writer ✅ Merged
#17676 docs: Fix duplicate object description warnings in Sphinx doc build ✅ Merged
#17675 docs(geospatial): Expand convex_hull_agg and geometry_union_agg docs ✅ Merged
#17669 feat: Register Spark transform_values function ✅ Merged
#17668 perf(tpcds): Eliminate redundant map allocations in toTableName and fromTableName ✅ Merged

🔥 Apache Spark

PR Description Status
#56154 [SPARK-49798][DOCS] Fix inaccurate documentation of RuntimeConfig.get ✅ Merged
#56250 [SPARK-56561][PYTHON][DOCS] Document order preservation for array_distinct, array_intersect, array_union, array_except 🔄 Open
#56248 [SPARK-34679][DOCS] Add inferTimestamp option to JSON data source options table 🔄 Open
#56178 [SPARK-40437][SS][PYTHON] Support string representation of durationMs in GroupState.setTimeoutDuration 🔄 Open
#56174 [SPARK-43847][PYTHON] Throw structured error when reading Protobuf descriptor file fails 🔄 Open

📦 aws-samples/aws-etl-orchestrator

PR Description Status
#9 Migrate to Python3.12 🔄 Open

📦 duckdb/duckdb

PR Description Status
#23104 Fix *COLUMNS() false rejection when operators appear in lambda bodies ❌ Closed

📦 google/it-cert-automation-practice

PR Description Status
#2336 Closes: #1 🔄 Open

📌 Featured Work

Most of my production work lives in private repositories at Zettabolt Technologies Private Limited. The best window into my hands-on contributions is the open source PRs above — real code, real reviews, real projects.

My pinned repos include forks of the projects I actively work in:


💬 Dev Quote


🐍 Contribution Snake

github-snake


🤝 Connect With Me

 

Open to conversations on Big Data infrastructure, accelerated computing & LLM systems.


"From the query engine to the transformer kernel — it's all just bytes waiting to go faster."

Pinned Loading

  1. spark spark Public

    Forked from apache/spark

    Apache Spark - A unified analytics engine for large-scale data processing

    Scala

  2. incubator-gluten incubator-gluten Public

    Forked from apache/gluten

    Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

    Scala

  3. facebook-velox facebook-velox Public

    Forked from facebookincubator/velox

    A composable and fully extensible C++ execution engine library for data management systems.

    C++

  4. pytorch pytorch Public

    Forked from pytorch/pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python

  5. aws-etl-orchestrator aws-etl-orchestrator Public

    Forked from aws-samples/aws-etl-orchestrator

    A serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.

    Python