DocumentCloud has an immediate opening for a contract Platform Engineer! If you’d enjoy a chance to help develop the next generation of our service — an open-source civic platform that more than 1,500 news organizations use to analyze, annotate and publish documents for the public good — we’d love to hear from you.
This is a contract position funded by a grant from the Knight Foundation. We’re a nimble, tightly knit team that works remotely — we stay connected via Slack and video chat.Hours are flexible.
You’ll work on developing DocumentCloud’s processing pipeline, which makes searching and analyzing document collections accessible to journalists, to improve DocumentCloud’s extraction and analysis capabilities. The pipeline consists of several open source tools wrapped in our Ruby-based infrastructure (a Rails-driven API and our CloudCrowd parallel processing toolkit). You’ll also play a key role in developing our production API capabilities, especially focused around what information we extract for users from documents and how best to do so.
Our ideal candidate would have the following skills and qualities:
— Independent problem-solver who values learning, keeps current on trends, and knows how to pick the right set of tools for a problem.
— Able to write clean, well-documented code; experience with Git and GitHub.
— Strong ability to collaborate and communicate with a distributed team.
— Ruby and Rails.
— Experience with Unix-based systems, Amazon Web Services and production environments.
— Knowledge of SQL (Postgres preferred).
Bonuses:
— Some knowledge of data science, linguistics, information extraction or search. SOLR experience is a bonus.
— An interest in language and data processing.
You’ll join DocumentCloud at a significant time. We’re enjoying widespread use of our platform, and our tools are used in some of the best journalism being published. You’ll have the chance to be part of the community exploring the intersection of news, data and technology.
To apply, please contact us at jobs@documentcloud.org and include a resume and code samples via GitHub.