Jyh2eh GitHub - Search News

News

QwenLM/Qwen2.5-Math: A series of math-specific large language models of our Qwen2 series. - GitHub

We evaluate our Qwen2.5-Math base models on three widely used English math benchmarks GSM8K, Math, and MMLU-STEM. In addition, we also evaluate three Chinese math benchmarks CMATH, GaoKao Math Cloze, ...

GitHub18h

GitHub - magnitudedev/magnitude: The AI browser automation framework

Problem #1: Most browser agents draw numbered boxes around page elements - doesn't generalize well due to complex modern sites. Solution: Vision-first architecture. Visually grounded LLM specifies ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

News

Trending now