OpenSea is on a mission to build an open digital economy, helping the world’s creators, collectors and collaborators own and shape their relationships directly. As the first and largest marketplace for non-fungible tokens (NFTs), customers can browse, buy, sell, and mint NFTs across seven different blockchains on the OpenSea platform.
Trust and safety are some of the most significant barriers to welcoming new people into the Web3 ecosystem. While Web3 and NFT technology are starting to mature, users new to the world of NFTs are still often deceived by fake content and need help properly distinguishing between original NFTs and copymints (or NFTs that are duplicates or imitations of popular NFTs). As a leader in the space, OpenSea was looking for a vendor to help advance their detection and removal capabilities to identify and mitigate copymints and fraud as early as possible.
“One of the most important top-level company objectives our team works on is ensuring that OpenSea stays the safest and most trusted destination for users looking to discover and purchase NFTs. Our work with Scale is key to accomplishing this objective, particularly around ensuring that users are not misled or deceived into purchasing inauthentic items.”
Product Manager, OpenSea
OpenSea needed an industry-leading solution that could identify and handle a dynamic set of deceptive NFTs. Before working with Scale, the OpenSea team was early in their AI journey. Though the team already used rule-based systems to help capture forms of deception, it was a challenge to attain the desired speed, recall, and precision needed to effectively address fraud in the marketplace. OpenSea approached Scale due to the team’s experience building customized models and ability to ramp quickly for customers.
Scale Content Understanding provides data enhancement for better platform experiences by enriching, analyzing, and categorizing content. Here, Scale Content Understanding proved robust in testing against OpenSea’s high data volume of up to 50 million items a week and proved an effective partner for tackling their problems.
With Scale Content Understanding, the team offered a fast turnaround models-as-a-service solution to categorize and determine whether a given NFT is a close match of another one. Scale’s ML models provided multiple layers of deduplication processes: starting with real-time detection through API-based solutions and full catalog scans to remove historical scams.
Real-Time NFT Detection
When an NFT is minted, OpenSea needs to quickly detect whether or not it is a copymint and remove it from its site to mitigate the chance of a user purchasing that copyminted item. Scale Content Understanding facilitates this through two ways: 1) a real-time API and 2) a recurring batch job:
In order to satisfy the scale and complexity of the problem, the Scale team trained custom deep learning image models to represent NFTs as embeddings in a manifold. With this setup, similar items will be nearby, meaning copymints would sit near each other, and items that are significantly different will have a further distance from one another. Scale converted and stored all these NFT embeddings in a vector database and built out systems for real-time querying and retrieval through k-Nearest Neighbors algorithms.
“The real-time detection of fuzzy matches is really tricky for systems to get right, but I think Scale’s models have really nailed it.” - Charles Zaffaroni, Product Manager, OpenSea
Full Catalog Scans
Over time, OpenSea's operations team verifies more collections on their platform, and Scale's engineering team improves the model’s performance based on this feedback. These improvements are retroactively applied to the complete set of items to take down any previously missed fraudulent items. Scale Content Understanding provides the capability to run full catalog scans, scanning hundreds of millions of items with high precision over a short period of time.
“We saw Scale as an established leader within the space with a track record of success. We had confidence from the get-go that Scale would deliver on our needs and be a good partner in the space. Scale was willing to experiment in new fields and develop new technology required for our systems and infrastructure to help us evolve.”
Product Manager, OpenSea
After kicking off the project with Scale Content Understanding, OpenSea was able to dramatically accelerate their copymint detection capabilities – today detecting copyminted NFTs in real time. Scale provided model-based systems to help process large volumes of data and provide signals to the OpenSea team. One of the critical measures of success for OpenSea was reducing the latency between an NFT being created on the platform to identifying bad content and taking it down.
The second measure of success is the ability to handle the large volumes of data from OpenSea. Scale processes up to 50 million items a week with 95% average precision. By quickly detecting and removing inauthentic NFTs, OpenSea is able to improve user trust in their marketplace. Here’s to a safer OpenSea, and thus a safer web3.
“We have a lot of different systems that try to detect malicious behavior. In terms of speed of deployment, Scale helped us achieve our goal of detecting copymints in under 30 seconds. The Scale team helped us make a material difference in our copy detection effort.”
Data Engineer, Trust and Safety, OpenSea