The Media Cloud project (http://mediacloud.org) is seeking a Senior Data Engineer to develop scalable text analysis pipelines, research and implement cutting-edge text classification approaches, and support and collaborate on academic research projects related to media attention, hate speech, and social media platforms. The Media Cloud platform is a set of online tools, and associated research methods, for monitoring and measuring online media.
In this grant-funded role, you will wear many hats - exploratory data scientist, text analysis expert, data pipeline engineer, research collaborator, product manager, and more. You will work closely with the principal investigators and a team of media researchers to research, prototype, and develop data analysis workflows that can scale from initial prototypes to corpora of millions of documents. Some of this will rely on skills you already have, but you will have to do significant work learning new skills and exploring cutting-edge supporting technologies and algorithms. This position provides an opportunity for someone to work on leading tools that support critical research into how social mobilization interacts with media and to help make Media Cloud more useful for researchers and non-profits trying to understand the role of media for democratic processes. We expect scholarly and popular press publications to come out of this research.
Given the conditions created by the ongoing pandemic, this position is open to part-time remote status. However, it does require being on site at Northeastern at regular intervals.
Primary Duties and Responsibilities
The Media Cloud project (http://mediacloud.org) is seeking a Senior Data Engineer to develop scalable text analysis pipelines, research and implement cutting-edge text classification approaches, and support and collaborate on academic…