Staff Software Engineer, Big Data Storage
About the role
About Pinterest:
Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we’re on a mission to bring everyone the inspiration to create a life they love, and that starts with the people behind the product.
Discover a career where you ignite innovation for millions, transform passion into growth opportunities, celebrate each other’s unique experiences and embrace the culture to do your best work. Creating a career you love? It’s Possible.
At Pinterest, AI isn't just a feature, it's a powerful partner that augments our creativity and amplifies our impact, and we’re looking for candidates who are excited to be a part of that. To get a complete picture of your experience and abilities, we’ll explore your foundational skills and how you collaborate with AI.
Through our interview process, what matters most is that you can always explain your approach, showing us not just what you know, but how you think.
What you’ll do:
- Design, implement, and optimize Pinterest’s exabyte-scale data lake storage platform.
- Lead complex technical projects and initiatives for data lake storage and metadata management, driving them from architecture through execution.
- Collaborate with stakeholders and partner teams across the organization to design storage and metadata layer technologies that unlock big data and ML/AI innovations.
- Build storage capabilities that efficiently support large-scale ML/AI workloads, including high-throughput data access, schema evolution, and large-scale column backfills.
- Shape the long-term technical direction for scalable, reliable, and efficient big data storage systems.
- Engage with and contribute to open source communities such as Apache Iceberg, Spark, and Flink to help address Pinterest’s scaling challenges.
What we’re looking for:
- 8+ years of relevant industry experience designing and building large-scale production distributed systems.
- Strong experience designing and maintaining scalable storage, metadata, or data lake infrastructure.
- Experience building storage capabilities for large-scale ML/AI or analytics workloads, including high-throughput data access, schema evolution, and large-scale column backfills.
- Deep knowledge with building distributed systems, data storage systems, and production infrastructure.
- Experience with big data technologies such as Apache Iceberg, Spark, Flink, Presto/Trino, Hive, or similar systems.
- Proficiency in programming languages like Java, Scala, or Python.
- Proven ability to lead complex technical initiatives and influence architecture across teams.
- Strong collaboration, communication, and problem-solving skills, with a drive for technical excellence and innovation.
- Bachelor’s degree in a relevant field such as Computer Science, or equivalent experience.