Show HN: Catsu: A unified Python client for embedding APIs

Posted by bhavnicksm 12 hours ago

We open-sourced catsu, a Python client for embedding APIs.

The problem: every embedding provider has a different SDK with different bugs. OpenAI has undocumented token limits. VoyageAI's retry logic was broken until September. Cohere breaks downstream libraries every release. LiteLLM's embedding support is minimal.

catsu provides:

- One API for 11 providers (OpenAI, Voyage, Cohere, Jina, Mistral, Gemini, etc.)

- Bundled database of 50+ models with pricing, dimensions, and benchmark scores

- Built-in retry with exponential backoff

- Automatic cost tracking per request

- Full async support

Example:

```python

import catsu

client = catsu.Client()

response = client.embed(model="voyage-3", input="Hello!")

print(f"Cost: ${response.embeddings}")

print(f"Cost: ${response.usage.cost:.6f}")

```

GitHub: https://github.com/chonkie-inc/catsu

We built this at Chonkie (YC X25) because we needed it badly for our chunking library and to easily switch embeddings for our search product. Apache 2.0.

Comments

Comment by zerodayz 11 hours ago

this is huge, been wrangling with embedding libraries lately so will def try this out

Comment by bhavnicksm 11 hours ago

thanks! this is still pretty early, please let us know if you face any issues with the library, database or anything else :)

Comment by lennertjansen 11 hours ago

nice, this is an annoying problem. does it also provide fallback to switch providers when one isn't available?

Comment by bhavnicksm 11 hours ago

it doesn't right now, but the fallback feature is planned for in a future release. mostly because there's no simple way to handle the classic fallbacks like aws, gcp and azure, and we wanted to spend some time thinking about their DX.

Comment by citcho 10 hours ago

nice library, was looking for something like this for a while, thanks for building this