通过对比简历的数据模式,发现GitHub项目xzfan/data-import(改项目已被删除)疑似为收集这些简历数据的爬虫。该爬虫会收集来自58同城等各个求职平台的简历。58同城否认数据泄漏来自他们,认为是第三方爬虫泄漏了简历数据:
We have searched all over the database of us and investigated all the other storage, turned out that the sample data is not leaked from us. It seems that the data is leaked from a third party who scrape data from many CV websites.
目前,泄漏数据的MongoDB数据库已经无法访问,但是通过日志发现,有数十个IP曾经访问过该数据库,这意味着简历数据可能已经泄漏。
本文素材来自互联网