[Paper] RWKV: Reinventing RNNs for the Transformer Era

anon22939549 · December 21, 2023, 4:29am

For me, the most interesting parts are,

Table 1

Figure 7

Showing hugely reduced demands for inference.

There are still open questions, of course, but if this develops into something genuinely equivalent to Transformers, well… that will be, as they say, huge if true.

The net result should be models which have far lower VRAM requirements and are blazingly fast.

This would have two, immediate, consequences.

It will be much easier to self-host larger, more powerful models.
Even larger, even more powerful models will cost much less to run at scale, causing API prices to plunge.

Imagine GPT-4 API calls at 1/4 the cost of GPT-3.5-Turbo…

Topic		Replies	Views
Interesting Research: Using Bipartite Graphs to prove GPT-4 can Understand Text Community gpt-4	3	1026	January 28, 2024
Interesting Research Out of Anthropic on Long Context Prompting Prompting long-context	39	8643	December 20, 2023
Discussion thread for "Foundational must read GPT/LLM papers" Community gpt-4 , gpt-35-turbo , chatgpt , research	75	10557	September 3, 2024
Model Sliding: A Logical Approach to AI Model Selection Community gpt-4 , chatgpt , api	7	1347	July 12, 2023
Biggest problem with LLMs: "LLMs don't know anything about how they themselves are built" Community agi	22	305	February 7, 2025

[Paper] RWKV: Reinventing RNNs for the Transformer Era

Table 1

Figure 7

Related topics