This is a fully remote, mid-level position with TigerGraph, a company specializing in scalable graph databases for enterprise analytics and machine learning. The role involves designing and maintaining distributed systems for graph data, focusing on performance and scalability.
Skills / Requirements
- C++
- Cypher
- Data Ingestion
- Debugging
- Distributed Systems
- Docker
- ETCD
- Fault Tolerance
- Go
- Graph Database
- gRPC
- Kafka
- Performance Tuning
- Query Optimization
- REST API
- Scalability
- System Design
- Vibe Coding
Why Apply
The role is ideal for developers who use AI tools to enhance productivity, as it involves adopting AI-assisted engineering practices. You'll work with technologies like vector databases and similarity search libraries, which benefit from AI-driven optimization.
What You'll Be Doing
You will design, implement, and maintain scalable distributed systems for graph data, focusing on optimizing data ingestion and query pipelines. Your work includes diagnosing complex issues, conducting root cause analysis, and implementing preventive measures.
Pay and Career Growth
The role offers the chance to work with cutting-edge technology in a company trusted by Fortune 500 firms. With a focus on AI and machine learning, it provides opportunities for professional growth in a dynamic field. The hybrid work option offers flexibility for those near company offices.
Benefits and Perks
- AI-assisted engineering
- competitive salary
- Hybrid option
- Remote Work
Is This Role Right for You?
Good fit if you...
- Experienced with vector databases and similarity search libraries.
- Proficient in designing distributed systems and handling scalability challenges.
- Comfortable using AI-assisted coding tools to enhance development processes.
May not be for you if...
- Lacks experience with graph databases or query languages like Cypher.
- Unfamiliar with distributed systems concepts such as sharding and fault tolerance.
- Prefers traditional coding practices over AI-assisted development.
Original Job Description
TigerGraph is a platform for advanced analytics and machine learning on connected data. TigerGraph’s core technology is the only scalable graph database for the enterprise. Its proven technology supports fraud detection, customer 360, MDM, IoT, AI, and machine learning.
Fortune 500 organizations and the most innovative mid-size and startup companies choose TigerGraph to accelerate their analytics, AI, and machine learning:
Seven out of the top ten global banks use TigerGraph for real-time fraud detection.
Over 50 million patients receive care path recommendations to assist them on their wellness journey.
300 million consumers receive personalized offers with recommendation engines powered by TigerGraph.
TigerGraph reduces power outages by optimizing the energy infrastructure for 1 billion people.
This position is primarily remote, but location-based requirements may apply. If the selected candidate is located near one of our company offices, the candidate will have a hybrid work arrangement (2-3 days in-office).
Job Responsibilities
Design, implement, and maintain highly available, scalable, and fault-tolerant distributed systems for graph data.
Tackle performance and scalability challenges, optimizing data ingestion, indexing, and query pipelines for low-latency and high-throughput requirements. Conduct systematic profiling and tuning.
Build, optimize, and operate our core vector embedding infrastructure to enable efficient nearest neighbor search at scale.
Proactively diagnose, debug, and resolve complex issues across the entire data stack, from performance bottlenecks and data inconsistencies to system failures. Lead root cause analysis for production incidents and implement preventive measures.
Requirements
Bachelor’s degree in Computer Science or a related field
5 years of relevant experience
Skills and Knowledge
Deep, hands-on experience with one or more vector databases or similarity search libraries.
Proven experience designing and working with any graph database and query languages like Cypher
Solid understanding of distributed systems concepts: consensus, replication, sharding, and fault tolerance.
Solid programming fundamentals; experienced with C++, Go, or any other major programming language.
Understanding of distributed systems principles and the ability to evaluate trade-offs in system design.
Familiar with Kafka, ETCD or similar technologies;
Proactive and collaborative team player with strong communication skills.
Open to adopting AI-assisted engineering practices (“vibe coding”) to improve productivity and code quality.
Bonus Points
Familiar with container tools such as Docker.
Hands-on experience with gRPC or REST APIs.
Passionate about systems performance profiling, tuning, or debugging.