Datasaur, an organization constructing a textual content data-labeling platform, at this time introduced it has raised a $1 million seed spherical from angel buyers like Segment CTO Calvin French-Owen. Coming out of stealth at this time, Datasaur was based in February 2019 and makes use of semi-automated labeling and a few pretrained fashions to hurry up the data-labeling course of and gasoline the advance of pure language processing (NLP) fashions.
Datasaur was based by Ivan Lee, who has spent the previous seven years working as a product supervisor on AI ventures at firms like Yahoo, most just lately for Apple’s Siri workforce. Before working at Apple, Lee sold mobile gaming startup Loki Studios to Yahoo in 2013.
As a part of the Winter 2020 batch, Datasaur will current subsequent month at Y Combinator’s Demo Day in San Francisco.
“As a PM, I came to appreciate just how powerful AI was, but I also recognized that I was constantly trying to get more labeled data for my engineers. It was this insatiable appetite. We were spending millions of dollars gathering this data, but it was a tedious job, it was an inefficient process, and I saw a lot of these companies reinventing the wheel when it came to how they should set up their labeling processes,” Lee instructed VentureBeat in a cellphone interview.
The funding will go towards launching the Datasaur NLP platform, which was in closed beta till at this time, and including performance that helps managers do issues like delegate assignments or detect bias in information units.
Early Datasaur customers embrace companies, teachers, and researchers working with the Indonesian authorities to flag on-line information articles and guard in opposition to election tampering.
Datasaur goes up in opposition to plenty of data-labeling startups, like Labelbox, which final month raised $25 million, and CloudFactory, which raised $65 million final fall.
Above: Datasaur person interface
But Lee expects Datasaur will have the ability to compete by focusing solely on software program for labeling textual content information. He believes firms in any business hungry for insights from textual content information will more and more discover data-labeling instruments important.
“We’re seeing a lot of companies [that] need to set up their own labeling processes, and so we want to help bring them the same efficiencies that any of these other services have been able to build,” he mentioned.
Datasaur presently has 10 staff in Sunnyvale, California and Indonesia.