scale.fun

scale.fun: the on-chain launchpad for datasets, powered by real zk-verified humans

scale.fun

Created At

ETHGlobal Buenos Aires

Winner of

Self Protocol

Self - Best Self onchain SDK integration 1st place

Project Description

scale.fun connects AI Labs needing labeled training data with a global, crypto-incentivized workforce on mobile.

šŸ“± AI Labs upload raw datasets (images, audio, text) and set bounties in cUSD. Workers complete micro-labeling tasks on their phones through MiniPay and get paid instantly. All labeled datasets are stored on Filecoin and sold on our decentralized marketplace.

Key features: šŸ” ZK-verified identity via Self Protocol (age/country verification for region-specific data labeling) šŸ’° Instant cUSD micropayments on Celo for each completed annotation šŸ“¦ Filecoin storage for immutable dataset archiving šŸ›’ Decentralized marketplace where AI Labs can buy/sell verified labeled datasets šŸŒ Mobile-first design optimized for MiniPay users in emerging markets like Africa, Latin America, India šŸ‡æšŸ‡¦šŸ‡¦šŸ‡·šŸ‡®šŸ‡³

This solves the massive bottleneck in AI development: getting high-quality, ethically-sourced training data at scale.

How it's Made

Built as a Celo MiniApp using Next.js 14, optimized for MiniPay mobile wallets in emerging markets.

āš™ļø Smart Contracts (Solidity):

  • DataAnnotateEscrow: Manages worker bounties and payouts in cUSD
  • DatasetMarketplace: Handles dataset sales with 15% platform fee
  • Deployed on Celo Mainnet (42220)

šŸ“¦ Filecoin Integration:

  • Synapse SDK for uploading labeled datasets to Filecoin Calibration Testnet
  • Fast retrieval API for buyers to download purchased datasets
  • Stores both raw files and annotation JSON with content-addressed CIDs
  • Background upload queue to handle large dataset batches

šŸ” Self Protocol (ZK Verification):

  • Zero-knowledge proofs for age/country verification without exposing PII
  • Enables task gating for location and age-specific data labeling requirements
  • Stored on-chain for trustless verification

šŸ› ļø Tech Stack:

  • Frontend: Next.js 14, TailwindCSS, Wagmi/Viem for wallet connections
  • Backend: Supabase (PostgreSQL) for user data, annotations, and marketplace listings
  • Storage: Filecoin Calibration Testnet via Synapse SDK
  • Blockchain: Celo Mainnet (real cUSD payments)

šŸ’” Hacky Bits:

  • Mixed testnet/mainnet: Self Protocol on testnet, but marketplace on mainnet
  • Background Filecoin uploads: Built queued API for 60-90s uploads
  • Mobile-first download: Custom component for MiniPay WebView constraints
  • Grouped dataset logic: Multiple files share one on_chain_dataset_id but separate CIDs
background image mobile

Join the mailing list

Get the latest news and updates