EventFrontier
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
cm0002@lemmy.world to Machine Learning | Artificial Intelligence@lemmy.worldEnglish · 3 months ago

Jan v1: 4B open model for web search with 91% SimpleQA, slightly outperforms Perplexity Pro

arxiv.org

external-link
message-square
0
fedilink
5
external-link

Jan v1: 4B open model for web search with 91% SimpleQA, slightly outperforms Perplexity Pro

arxiv.org

cm0002@lemmy.world to Machine Learning | Artificial Intelligence@lemmy.worldEnglish · 3 months ago
message-square
0
fedilink
Lucy: edgerunning agentic web search on mobile with machine generated task vectors
arxiv.org
external-link
Small language models (SLMs) are inherently limited in knowledge-intensive tasks due to their constrained capacity. While test-time computation offers a path to enhanced performance, most approaches treat reasoning as a fixed or heuristic process. In this work, we propose a new paradigm: viewing the model's internal reasoning, delimited by and tags, as a dynamic task vector machine. Rather than treating the content inside these tags as a mere trace of thought, we interpret the generation process itself as a mechanism through which the model \textbf{constructs and refines its own task vectors} on the fly. We developed a method to optimize this dynamic task vector machine through RLVR and successfully trained an agentic web-search model. We present Lucy, a 1.7B-parameter SLM that leverages this dynamic reasoning mechanism with MCP integration to achieve 78.3% accuracy on the SimpleQA benchmark, performing on par with much larger models such as DeepSeek-V3. This demonstrates that small models can rival large ones when equipped with structured, self-constructed task reasoning.
alert-triangle
You must log in or register to comment.

Machine Learning | Artificial Intelligence@lemmy.world

machinelearning@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Welcome to Machine Learning – a versatile digital hub where Artificial Intelligence enthusiasts unite. From news flashes and coding tutorials to ML-themed humor, our community covers the gamut of machine learning topics. Regardless of whether you’re an AI expert, a budding programmer, or simply curious about the field, this is your space to share, learn, and connect over all things machine learning. Let’s weave algorithms and spark innovation together.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1 user / day
  • 17 users / week
  • 17 users / month
  • 42 users / 6 months
  • 1 local subscriber
  • 1.13K subscribers
  • 59 Posts
  • 24 Comments
  • Modlog
  • mods:
  • Hopps@lemmy.world
  • BE: 0.19.8
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org