Youtube视频搜索

1. requirements

  • What is the primary (business) objective of the search system?

  • What are the specific use cases and scenarios where it will be applied?

  • What are the system requirements (such as response time, accuracy, scalability, and integration with existing systems or platforms)?

  • What is the expected scale of the system in terms of data and user interactions?

  • Is their any data available? What format?

  • Can we use video metadata? Yes

  • Do we need consider Personalization? not required

  • How many languages needs to be supported?

2. pipeline

3. data collection

4. feature

5. model

text

video

loss

6. Evaluation

  • Offline

    • Precision@k, mAP, Recall@k, MRR

    • we choose MRR (avg rank of first relevant element in results) due to the format of our eval data <video, text> pair

  • Online

    • CTR: problem: doesn't track relevancy, click baits

    • video completion rate: partially watched videos might still found relevant by user

    • total watch time

    • we choose total watch time: good indicator of relevance

7. deployment and prediction service

  • A/B test

  • Scaling

8. monitoring and maintenance

reference

Last updated