Site Reliability Engineer – Capcut
Company | ByteDance |
---|---|
Location | San Jose, CA, USA |
Salary | $Not Provided – $Not Provided |
Type | Full-Time |
Degrees | Bachelor’s |
Experience Level | Mid Level, Senior |
Requirements
- Bachelors or higher degree in Computer Science or related technical discipline.
- 2-5 years of working experience in the Internet industry.
- Solid knowledge of Computer Science, and familiar with the principles of Operating System, Computer Storage, Computer Networking etc.
- Software development experience in at least one programming language, such as Java/Go/C++/Python/JS.
- Strong ability to resolve system problems, good communication skills and a sense of ownership.
Responsibilities
- Design and develop solutions to automate the technical operations of large-scale systems, and work closely with teams to improve stability from a Software Development Lifecycle perspective.
- Take the technical effort to strengthen CapCut systems’ stability, which includes but is not limited to the monitoring, logs, dashboard, diagnosis tools etc; conduct usual drills and develop remedy plans to achieve fast service restoration, and take shifts to respond to production issues across regions.
- Define the indicators to evaluate the performance and runtime of the system to improve the observability, facilitating system development and trouble-shooting process; and, plan the system capacities according to business expansion and scheduled promotions.
Preferred Qualifications
- Experiences of Redis, MySQL, Nginx, Kubernetes, Docker, etc. are plus.