Especialista en Sistemas e Infraestructura
Se busca Especialista en Sistemas e Infraestructura con experiencia en entornos containerizados y DevOps para trabajar de forma remota, con un pago por hora en USD y requisito de inglés avanzado.
Descripción del puesto
LINK DIRECTO A LA ENTREVISTA: https://tinyurl.com/Ref-Systems-Infrastructure-AI
FULL LIST BUSQUEDAS: https://tinyurl.com/FullJobsList
Buscamos Systems & Infrastructure Specialist — modalidad remota, pago entre $40 - $70 USD/hr. Entrevista y trabajo en inglés. Ideal para perfiles con experiencia en sistemas, DevOps, infraestructura y entornos containerizados.
Systems & Infrastructure Specialist We are seeking Systems & Infrastructure Specialists to support AI training initiatives through infrastructure management, containerized environments, and systems-level problem solving.
Responsibilities
- Navigate, troubleshoot, and recover dynamic infrastructure and long-running processes using command-line tools
- Manage containerized environments (Docker, CI/CD pipelines, sandboxed systems)
- Build, maintain, and optimize systems for AI training and compute workloads
- Respond to system failures with rapid debugging and recovery strategies
- Collaborate with engineering and AI teams on reliability and performance
- Document system architecture, incidents, and recovery procedures
- Adapt to evolving infrastructure and tooling requirements
Requirements
- Strong experience in terminal-based system administration and infrastructure work
- Advanced problem-solving in multi-step debugging and system recovery
- Experience with Docker and containerized environments
- Proficiency in at least one systems language (Python, Bash, Go, Rust, C/C++)
- Familiarity with build systems, package managers, CI/CD, and distributed systems
- Strong written and verbal communication skills
- Experience working in remote, fast-paced technical environments
Nice to have
- Experience in SRE or DevOps roles
- Exposure to high-compute or AI/ML infrastructure environments
- Knowledge of orchestration tools and scalable system design
Dirigido a: Especialistas en Sistemas e Infraestructura / Ingenieros de Infraestructura / Ingenieros de Sistemas / Ingenieros DevOps / Ingenieros Cloud / Administradores de Sistemas / Ingenieros de SRE (Site Reliability Engineering) / Ingenieros de Plataforma / Ingenieros de Redes / Especialistas en Infraestructura TI / Ingenieros de Operaciones IT / Ingenieros de Backend Infrastructure / Especialistas en Virtualización / Ingenieros de Kubernetes / Ingenieros de Linux Systems / Especialistas en Data Center / Ingenieros de Seguridad de Infraestructura / Ingenieros de Automatización de Infraestructura / Especialistas en Soporte de Sistemas / Ingenieros de Escalabilidad
Áreas y habilidades: Infraestructura de sistemas / Linux / Windows Server / Cloud computing / AWS / Azure / Google Cloud / DevOps / SRE / Kubernetes / Docker / CI/CD / Networking / Virtualización / VMware / Terraform / Ansible / Infrastructure as Code / Monitoring / Logging / Observability / System administration / Performance tuning / Incident response / High availability / Disaster recovery / Seguridad de infraestructura / Automatización / Resolución de problemas / Colaboración remota
Targeted at: Systems & Infrastructure Specialists / Infrastructure Engineers / Systems Engineers / DevOps Engineers / Cloud Engineers / Site Reliability Engineers / Platform Engineers / Network Engineers / IT Infrastructure Specialists / Systems Administrators / Backend Infrastructure Engineers / Virtualization Engineers / Kubernetes Engineers / Linux System Engineers / Data Center Engineers / Infrastructure Security Engineers / Infrastructure Automation Engineers / IT Operations Engineers / Scalability Engineers / Systems Support Engineers
Skills & areas: Systems infrastructure / Linux / Windows Server / Cloud computing / AWS / Azure / Google Cloud / DevOps / SRE / Kubernetes / Docker / CI/CD / Networking / Virtualization / VMware / Terraform / Ansible / Infrastructure as Code / Monitoring / Logging / Observability / System administration / Performance tuning / Incident response / High availability / Disaster recovery / Infrastructure security / Automation / Problem-solving / Remote collaboration
Responsabilidades
- Navegar, solucionar problemas y recuperar infraestructura dinámica y procesos de larga duración
- Gestionar entornos containerizados (Docker, pipelines CI/CD, sistemas sandboxed)
- Construir, mantener y optimizar sistemas para entrenamiento de IA y cargas de trabajo de cómputo
- Responder a fallas del sistema con depuración rápida y estrategias de recuperación
- Colaborar con equipos de ingeniería y IA en fiabilidad y rendimiento
- Documentar arquitectura del sistema, incidentes y procedimientos de recuperación
- Adaptarse a los requisitos cambiantes de infraestructura y herramientas