Posted in

Site Reliability Engineer – Capcut

Site Reliability Engineer – Capcut

CompanyByteDance
LocationSan Jose, CA, USA
Salary$Not Provided – $Not Provided
TypeFull-Time
DegreesBachelor’s
Experience LevelMid Level, Senior

Requirements

  • Bachelors or higher degree in Computer Science or related technical discipline.
  • 2-5 years of working experience in the Internet industry.
  • Solid knowledge of Computer Science, and familiar with the principles of Operating System, Computer Storage, Computer Networking etc.
  • Software development experience in at least one programming language, such as Java/Go/C++/Python/JS.
  • Strong ability to resolve system problems, good communication skills and a sense of ownership.

Responsibilities

  • Design and develop solutions to automate the technical operations of large-scale systems, and work closely with teams to improve stability from a Software Development Lifecycle perspective.
  • Take the technical effort to strengthen CapCut systems’ stability, which includes but is not limited to the monitoring, logs, dashboard, diagnosis tools etc; conduct usual drills and develop remedy plans to achieve fast service restoration, and take shifts to respond to production issues across regions.
  • Define the indicators to evaluate the performance and runtime of the system to improve the observability, facilitating system development and trouble-shooting process; and, plan the system capacities according to business expansion and scheduled promotions.

Preferred Qualifications

  • Experiences of Redis, MySQL, Nginx, Kubernetes, Docker, etc. are plus.