Skip to main content

LLM Models

Tutorials
Chinese LLMs
Code LLMs
Evaluation/Monitor
  • PromptBench: A Unified Library for Evaluating and Understanding Large Language Models.
  • AI產品與系統評測中心: AI評測模擬測試題庫.xlsx
  • Opik is an open-source platform for evaluating, testing and monitoring LLM applications.
Function Calling LLMs
Content Safty
  • Google ShieldGemma
    ShieldGemma則是個安全分類模型,可額外部署在模型的輸入及輸出端,用以過濾有害內容,它主要篩選4大領域的內容,包括仇恨言論、騷擾、裸露的色情內容,以及危險內容。
Calculate VRAM required for LLM