OpenAI, the company behind the chatbot ChatGPT, has ramped up its hiring around the world, bringing on roughly 1,000 remote contractors over the past six months in regions like Latin America and Eastern Europe, according to people familiar with the matter.
About 60% of the contractors were hired to do what’s called “data labeling” — creating massive sets of images, audio clips, and other information that can then be used to train artificial intelligence tools or autonomous vehicles.
The other 40% are computer programmers who are creating data for OpenAI’s models to learn software engineering tasks. OpenAI’s existing Codex product, launched in Aug. 2021, is designed to translate natural language into code.
“A well-established company, which is determined to provide world-class AI technology to make the world a better and more efficient place, is looking for a Python Developer,” reads one OpenAI job listing in Spanish, which was posted by an outsourcing agency.
Previously, OpenAI trained its models on code scraped from GitHub, a repository site owned by its largest investor, Microsoft, which last week confirmed multi billion dollars in new funding first reported by Semafor. But in this case, OpenAI appears to be building a dataset that includes not just lines of code, but also the human explanations behind them written in natural language.
A software developer in South America who completed a five-hour unpaid coding test for OpenAI told Semafor he was asked to tackle a series of two-part assignments. First, he was given a coding problem and asked to explain in written English how he would approach it. Then, the developer was asked to provide a solution. If he found a bug, OpenAI told him to detail what the problem was and how it should be corrected, instead of simply fixing it.
“They most likely want to feed this model with a very specific kind of training data, where the human provides a step-by-step layout of their thought-process,” said the developer, who asked to remain anonymous to avoid jeopardizing future work opportunities. He has not yet been hired or rejected by OpenAI.
Sam Altman, OpenAI’s CEO, recently put the company’s headcount at 375 people, a tiny number compared to the thousands of staff at tech giants like Google and Facebook working on artificial intelligence. “I know I’m not supposed to brag about OpenAI,” he tweeted, touting the company’s “talent density.”
Altman did not appear to include contractors in that figure. But in OpenAI’s published research, the company has repeatedly noted the importance of outsourced labor in building its technology. “Finally, we’d like to thank all of our contractors for providing the data that was essential for training the models,” a team of researchers at OpenAI wrote in a paper last year.
OpenAI declined to comment on its hiring practices.