arxiv:2310.16944
Nathan Habib PRO
AI & ML interests
Evals
Recent Activity
upvoted an article about 3 hours ago
Is it agentic enough? Benchmarking open models on your own tooling published an article about 18 hours ago
Is it agentic enough? Benchmarking open models on your own tooling liked a model 4 days ago
nex-agi/Nex-N2-Pro