Job Opportunities at Facebook

I have several openings at Facebook in Palo Alto, CA. If interested, please send your resume to Michelle Bostock (mbostock-at-facebook.com). Thank you.

Site Reliability Engineer
Palo Alto, CA

Description
Facebook is seeking talented operations engineers to join the Site Reliability Engineering team. The ideal candidate will have strong communication skills, a passion for tinkering with Linux, and an almost insane fondness for fast-paced, seat-of-your-pants troubleshooting and crisis management. The position is full-time and is based in our main office in downtown Palo Alto. This position reports to the Manager of Site Reliability Engineering.

Responsibilities

* Monitor the stability and performance of the website
* Remotely troubleshoot and diagnose hardware problems
* Debug issues with Linux software, applications and network
* Resolve technical challenges encountered in LAMP technologies
* Develop and maintain monitoring tools and automation systems
* Predict and respond to utilization variances across multiple datacenters
* Identify and triage all outage related events
* Facilitate communication, coordinate escalation, and work with subject matter experts to implement critical fixes
* Automate and streamline processes
* Track issues and run reports

Requirements

* 2-3 years+ Linux support/sys admin experience in an Internet operations environment
* BA/BS in Computer Science or a related field, or equivalent experience
* Working knowledge of Linux, Cisco, TCP/IP, Apache and mySQL
* Experience working with network management systems and monitoring tools, such as Nagios, Ganglia and Cacti
* Competency in Shell, PHP, Perl or Python. C is a plus
* Solid understanding of web services architecture and commonly employed technologies
* A sense of urgency in responding to and resolving critical issues that relate to the performance of the site and/or core infrastructure
* Excellent verbal and written communication skills
* Participation in a shifted coverage schedule, including working nights and on-call rotations

Systems Architect
Palo Alto, CA

Description
Facebook is seeking a seasoned Systems Architect to join the Operations team. The position is full-time and is based in our main office in downtown Palo Alto and will report to the Manager of Systems Operations.

Responsibilities

* Analyze application flow and infrastructure design to improve performance and scalability of the site
* Collaborate on design of services infrastructure from servers to networking
* Monitor, analyze, and make recommendations as appropriate to improve site stability and availability
* Evaluate hardware and software technologies to improve site efficiency and performance
* Troubleshoot and solve issues with hardware, applications, and network components
* Lead team efforts from design to implementation, prioritize tasks and resources while interacting with Engineering and Operations
* Document current and future configuration processes and policies
* Participate in 24x7 on-call support

Requirements

* B.S. in Computer Science or equivalent experience
* 4+ years of experience in Operations with large web farms
* Extensive knowledge of web architecture and technologies, including Linux, Apache, MySQL, PHP, TCP/IP, security, HTTP, LDAP and MTAs
* Strong background/interest in application and infrastructure design
* Scripting and programming skills
* Excellent verbal and written communication skills 

Data Center Engineer
Palo Alto, CA

Description
Facebook is seeking a Data Center Engineer to join the Site Operations Team. This position is full-time and will be based in one of our Bay Area data center facilities.

Responsibilities

* Install new servers, switches, routers, and storage hardware
* Assist in coordination of data center tasks and resolving machine repairs tickets
* Understand and debug network related issues on network hardware as well as the OS
* Resolve technical challenges of managing servers in multiple geographical locations
* Develop and maintain server automation and installation tools
* Create documentation to streamline and improve upon data center best practices
* Identify opportunities for process improvement, plan, and implement changes.

Requirements

* Experience in data center deployment and building scaling infrastructure
* Working knowledge of TCP/IP, security, HTTP, LDAP, and LAMP
* Ability to script using one of the following: PHP, Python, or Bash
* Knowledgeable in Data center practices (i.e. cable routing, calculating power usage)
* Certification such as CCNA, RHCT or equivalent experience
* Experience working with server/network hardware and Linux OS
* Excellent verbal and written communication skills
* Ability to lift/move 20-30 lbs on a daily basis
* Off-hours coverage.


Hmm, if I wasn’t ‘tied down’ here in Norway I’d jump on the first plane and apply! 8)