小莱卡 to Crush Agentic AIEnglish · 8 days ago

Another reason to learn Chinese

2

16

Another reason to learn Chinese

小莱卡 to Crush Agentic AIEnglish · 8 days ago

2

1 English character ≈ 0.3 tokens. 1 Chinese character ≈ 0.6 token.

While reading the deepseek pricing model i came to the realization that China has an inherent advantage in AI due to their own language, the same language that proved to hinder the development of production at some point now it has turned into the opposite (hehe dialectics), a more efficient alternative.

Think of the first printing machines you needed to arrange pieces containing a single character to form words, in latin alphabet languages this meant producing 26 upper case pieces + 26 lower case pieces, for chinese you needed thousands lol. For the same reason, typewriters in chinese were never mass produced and the ones that were produced are ridiculously complex (see the mingkai typewriter that was recently found after being lost for decades).

Coming back to tokens, according to google gemini the average length of an english word is 4.7-5.1 characters, while in chinese a single character is a word by itself, this means that an english word is on average ~1.5 tokens while a chinese word is 0.6 tokens, it’s more than twice as efficient, this could be huge when thinking in scale and could prove to be the difference between needing a 10 data centers instead of 5.

So if you want to reduce your deepseek billing, get ready to learn chinese buddy.

Chat

小莱卡OP
link
fedilink
English
arrow-up
2·
edit-2
8 days ago
It’s a topic that deserves a more in-depth study for sure. Some words can be on the longer end like 爱沙尼亚(estonia) tho they still tend to be shorter. Comparing books sizes in English and chinese could give us a better aproximation.

Crush Agentic AI

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !crushagent@lemmygrad.ml

A community to talk about agentic AI, especially Crush, and to share our best practices, what we’re working on, etc.

Our mission: To make comrades as autonomous as possible as soon as possible on these new tools. To push the limits of what is possible.

Rules:

Be helpful, don’t retain information. The point is to learn from each other!
Guideline to help with that: when posting, be clear as possible about what exactly is AI or not, your process, your prompts, the limitations you ran into. Feel free to add more in the comments if you want to use your post as a journal/log.
Other agents are allowed although we like Crush and Deepseek. Non-agent posts are also allowed, if you want to talk about AI more generally.

Install Crush:

Run the appropriate command on the official repo (OS-dependent).
Open your shell interface, ‘cd’ to project folder - one folder per project.
Run ‘crush’ command.
On first run, it will ask for model. Type ‘Thinking’ to get Deepseek-v3 (Thinking).
Top up your Deepseek API balance here.
Create Deepseek API key and copy.
Go back to crush, paste API key.
Prompt and watch it go brrrrr.

Tip: Deepseek is currently one of the cheapest APIs period, and for performance. But Crush uses the old pricing data so don’t be scared if you see 10x the spending in Crush - chances are you have only spent 10 cents.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

2 users / day
0 users / week
0 users / month
0 users / 6 months
12 local subscribers
15 subscribers
20 Posts
55 Comments
Modlog

mods:
CriticalResist8