hero

Work at a Portfolio Company

Analytics Data Engineer- Inference

Together

Together

Data Science
Los Angeles, CA, USA · San Francisco, CA, USA
Posted on Jun 29, 2024
Role

Together AI is seeking an Analytics Data Engineer to lead the development of our research data pipelines. These pipelines are essential for conducting analyses, guiding business decisions, and driving product growth. If you are enthusiastic about data and eager to create impactful solutions, we want to hear from you. This role offers the chance to work closely with our AI researchers, helping build out our incredibly fast inference engine. Come join us in shaping the future at Together AI!

Responsibilities

  • Design, build and manage our data pipelines, ensuring all user event data is seamlessly integrated into our data lake.
  • Develop canonical datasets to track key product metrics including inference engine latency, throughput, user growth, and revenue.
  • Work collaboratively with various teams, including Research, Engineering, Product and Sales to understand their data needs and provide solutions.
  • Implement robust and fault-tolerant systems for data ingestion and processing.
  • Participate in research and engineering decisions, bringing data engineering expertise to the team.
  • Ensure the security, integrity, and compliance of data according to industry and company standards.
  • Develop data infrastructure and datasets for foundation model training and fine-tuning

Requirements

  • 3+ years of experience as a software or data engineer.
  • Proficiency in one or more programming languages used within our Data Engineering ecosystem, such as Python or Rust.
  • Experience with distributed processing technologies and frameworks, such as Spark, Kafka, Daft, Trino
  • Familiar with distributed storage systems such as S3 or R2
  • Solid understanding of Spark and ability to write, debug and optimize Spark code.
  • Knowledge of data formats such as Parquet and Arrow
  • Ability to statistically model and reason about traffic patterns
  • Preferred: Understanding of data quality and data mixtures for ML models

About Together AI

Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society. Together, we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI. Our team has been behind technological advancements such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next-generation AI infrastructure.

Compensation

We offer competitive compensation, startup equity, health insurance, and other competitive benefits. The US base salary range for this full-time position is $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level, and role. Individual compensation will be determined by experience, skills, and job-related knowledge.

Equal Opportunity

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunities to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.