From cryptomarkets to the surface web: Scouting eBay for counterfeits

Felix Soldner,Fabian Plum,Bennett Kleinberg,Shane D Johnson
2024-06-07
Abstract:Detecting counterfeits on online marketplaces is challenging, and current methods struggle with the volume of sales on platforms like eBay, while cryptomarkets openly sell counterfeits. Leveraging information from 453 cryptomarket counterfeits, we automated a search for corresponding products on eBay, utilizing image and text similarity metrics. We collected data twice over 4-months to analyze changes with an average of 159 eBay products per cryptomarket item, totaling 134k products. We found identical products, which would warrant further investigation as to whether they are counterfeits. Results indicate increasing difficulty finding similar products over time, moderated by product type and origin. Future improved versions of the current system could be used to examine possible connections between cryptomarket and surface web listings more closely and could hold practical value in supporting the detection of counterfeits on the surface web.
Social and Information Networks
What problem does this paper attempt to address?
The paper attempts to address the challenge of counterfeit goods detection in online markets. Specifically, the researchers focus on how to utilize information from cryptomarkets to automate the search and identification of counterfeit goods on surface web markets (such as eBay). Current methods struggle to handle the vast amount of sales data in large online markets like eBay, while cryptomarkets openly sell counterfeit goods. The researchers aim to improve the efficiency and accuracy of counterfeit goods detection by leveraging product information from cryptomarkets, combined with image and text similarity measures, to automatically search and identify related products on eBay. ### Main Issues: 1. **Challenges of Counterfeit Goods Detection**: The sales volume of counterfeit goods on online markets (such as eBay, Amazon) and social media platforms (such as Instagram, Facebook) is increasing, and existing detection methods are struggling to cope with this challenge. 2. **Utilization of Cryptomarkets**: Cryptomarkets (such as markets on the Tor network) openly sell counterfeit goods, and this information can be used to assist in the detection of counterfeit goods on surface web markets. 3. **Automated Search Methods**: How to utilize product information from cryptomarkets, combined with image and text similarity measures, to automatically search and identify counterfeit goods on surface web markets (such as eBay). ### Research Objectives: - **Automated Search**: Develop an automated system that uses product information (name, description, images, etc.) from cryptomarkets to search for similar products on eBay. - **Similarity Measures**: Determine the similarity between products on eBay and counterfeit goods on cryptomarkets through text and image similarity measures. - **Support for Manual Inspection**: Filter out the most likely matches through automated methods, reducing the workload of manual inspection and improving detection efficiency. ### Method Overview: - **Data Collection**: Manually collected information on 453 counterfeit goods from 12 cryptomarkets and conducted two rounds of data collection on eBay, each collecting about 66,000 to 68,000 product listings. - **Similarity Measures**: Calculate text and image similarity measures, including Word Mover Distance, Cosine Similarity, color histogram comparison, feature detection and matching (such as SIFT, SURF, ORB), and custom Siamese neural networks. - **Manual Annotation**: Recruited participants through a crowdsourcing platform to annotate 1,000 pairs of products to evaluate the effectiveness of automated similarity measures. ### Significance: - **Improved Detection Efficiency**: Automated methods can significantly reduce the workload of manual inspection and improve the efficiency of counterfeit goods detection. - **Cross-Market Analysis**: The research results indicate that there may be a certain correlation between cryptomarkets and surface web markets, which helps to understand the circulation path of counterfeit goods. - **Practical Application Value**: The system can provide technical support to law enforcement agencies and e-commerce platforms, helping them more effectively combat the sale of counterfeit goods.