Samsung is the world’s largest consumer electronics company and the leading provider for smart phones and smart TVs. Samsung smart TVs connect homes to the Internet providing a full range of intelligence capabilities such as speech recognition, gesture recognition, advanced video processing and personalized recommendation.
The Digital Media Solutions Lab at Samsung Research America is building a next-generation data platform to support Smart TV products and services. With California offices in Mountain View and Irvine we are close to a number of tech companies that we collaborate with. Our research and development include TV analytics, ads targeting, and personalized services. We ingest and process billions of records daily from millions of TVs in the field and we are looking for an experienced professional to join our team on the development of an integrated data platform. Our ideal candidate should be adept at enterprise software development using large scale distributed data processing technologies and machine learning.
You will work closely with other data engineers to implement data systems in production-stable environment. Examples include real-time events, batch processing, machine learning (including GPU), data warehouses, data APIs. The successful candidate will be a team player, interested in the success of the entire team as a whole. Job responsibilities include:
Design, develop and maintain distributed, large-scale, real-time event and log data processes in AWS environment
Design and develop scalable data stores with sub-second query latency on highly multi-dimensional data
Follow best practices for high availability, scalability and reliability of data processing components.
Perform capacity planning and cost estimates of proposed solutions
Design telemetry KPIs and monitoring of all new components in preparation to DevOps transition for monitoring
On-board new data sets and expose data insights through APIs
Participate and provide feedback on architecture reviews, development processes, etc.
Lead design and code reviews and coach junior team members.
B.S. in Computer Science or a related field
Recent and relevant experience with scalable distributed data processing, e.g. Spark, MPP or similar in 24x7 production environments
Strong programming skills in Scala/C++/Python/SQL, C# or other languages may substitute
Solid experience in Linux environment, ideally including kernel and network