Buscar

Oportunidad de empleo

Senior Data Engineer

The Media Cloud project (http://mediacloud.org) is seeking a Senior Data Engineer to develop scalable text analysis pipelines, research and implement cutting-edge text classification approaches, and support and collaborate on academic research projects related to media attention, hate speech, and social media platforms. The Media Cloud platform is a set of online tools, and associated research methods, for monitoring and measuring online media.


In this grant-funded role, you will wear many hats - exploratory data scientist, text analysis expert, data pipeline engineer, research collaborator, product manager, and more. You will work closely with the principal investigators and a team of media researchers to research, prototype, and develop data analysis workflows that can scale from initial prototypes to corpora of millions of documents. Some of this will rely on skills you already have, but you will have to do significant work learning new skills and exploring cutting-edge supporting technologies and algorithms. This position provides an opportunity for someone to work on leading tools that support critical research into how social mobilization interacts with media and to help make Media Cloud more useful for researchers and non-profits trying to understand the role of media for democratic processes. We expect scholarly and popular press publications to come out of this research.

Given the conditions created by the ongoing pandemic, this position is open to part-time remote status. However, it does require being on site at Northeastern at regular intervals.

Primary Duties and Responsibilities

  • Keep up to date on research in data analysis architectures, text classification, hate speech detection, social media platform policies, machine learning, etc. to inform new functionalities in the tooling and research output.
  • Work with other team members to establish a technical vision for the project.
  • Contribute to research papers with planning, writing, and data needs.
  • Maintain, upgrade, and build new data pipelines with data from existing corpora, APIs, and other sources.
  • Write code that can scale systems to handle ever-expanding data requirements.
  • Engage in active collaboration and coordination with the cross-institution research team.
  • Contribute to related project data needs as needed.
  • Provide budget, logistical, and HR inputs to support grant management.
  • Contribute to a healthy remote workplace and cultures.

Qualifications:

Required:

  • College degree or other domain-specific accreditation, preferably in computer science, data science, or related fields.
  • 2+ years of experience with cross functional engineering teams.
  • 5+ experience working with software and data in some capacity.
  • Programming fluency in Python and data-related libraries (pandas, Jupyter notebooks, etc).
  • Demonstrated ability to iterate quickly through prototypes.
  • Knowledge of large scale data collection, processing, and storing systems.
  • Ability to work productively in a virtual environment with remote team members all over the world.
  • Interest in working on issues related to media coverage and hate-speech, democracy, gender, race, or health.

Preferred:

  • Master’s degree or other domain-specific accreditation, preferably in computer science or data science related fields.
  • Prior experience as a senior software engineer or product development leadership.
  • Hands-on experience with complex technical project management.
  • Prior work with online media ingestion and storage.
  • Interest in working in an academic research environment on projects with real-world impact.

The Media Cloud project (http://mediacloud.org) is seeking a Senior Data Engineer to develop scalable text analysis pipelines, research and implement cutting-edge text classification approaches, and support and collaborate on academic…

Detalles a Simple Vista

  • Flexibilidad
    A Tiempo Completo
  • Fecha de inicio
    9 de mayo de 2022
  • Nivel de Experiencia
    Mando intermedio
Salario
USD $80.000 - USD $110.000
/ año

Nivel de Idiomas

English required

English required

Ubicación

A distancia
El trabajo se puede realizar en cualquier lugar en Estados Unidos
Ubicación Asociada
360 Huntington Avenue, Boston, MA 02115, United States
100 Meserve Hall

Únete a Idealist

Regístrate hoy para acceder a tus empleos favoritas y recibe alertas por email cuando se publiquen nuevas oportunidades.