Settings

Theme

Ask HN: What are some unusual but useful Python libraries you've discovered?

38 points by alexliu518 a year ago · 18 comments · 1 min read


Hey everyone! I'm always on the lookout for new and interesting Python libraries that might not be well-known but are incredibly useful. Recently, I stumbled upon Rich for beautiful console output and Pydantic for data validation, which have been game-changers for my projects. What are some of the lesser-known libraries you've discovered that you think more people should know about? Share your favorites and how you use them!

networked a year ago

Last year I posted a similar Ask HN; it got 11 comments: https://news.ycombinator.com/item?id=38505531. I asked, "What lesser-known Python libraries do you wish people knew about?" The suggestions there are worth looking up. Don't miss DiskCache (https://github.com/grantjenks/python-diskcache).

I really like xmltodict (https://github.com/martinblech/xmltodict). Despite the name, it works in both directions. It is the most ergonomic library I have used for creating XML. It has external type stubs: https://pypi.org/project/types-xmltodict/.

Since you have recently discovered Rich, you may want rich-argparse (https://github.com/hamdanal/rich-argparse). It colorizes argparse CLIs with little effort from the user.

DeepDiff (https://github.com/seperman/deepdiff) has helped me with testing. I needed to compare two nested data structures but ignore any differences in floats (timestamps). DeepDiff let me do it:

  diff = DeepDiff(
     run_session(config), run_session(config, force=True), exclude_types=(float,)
  )

  assert not diff
pzp (https://github.com/andreax79/pzp) is like fzf in pure Python to use in your programs. Keep in mind it is currently version 0.0.x. I have found bugs, but I think it is just cool that it exists.
Crier1002 a year ago

I use https://github.com/litl/backoff for configurable backoff + retry

context: OpenAI API used to be super flaky back in the early days, i needed to retry my requests quite frequently and i found this

  • networked a year ago

    Backoff doesn't look bad.

    I have used two libraries for retrying in Python: retry (https://github.com/invl/retry) and Tenacity (https://github.com/jd/tenacity). Tenacity actually helped me with a recent GPT-4o mini experiment, when OpenRouter gave me an error after a couple hundred requests. I can recommend Tenacity. A downside is that the API is fairly complex and verbose. You will probably look it up each time if you don't use it often (or rely on an IDE or LLM).

    Backoff seems somewhat less flexible than Tenacity (https://github.com/litl/backoff/issues/125) but more concise. Basic use requires as little code as retry and should be easy to remember.

    • ddorian43 a year ago

      "stamina" is a wrapper for tenacity with nice defaults

      • networked a year ago

        I didn't know about stamina. I like it a lot at first glance. Thanks for telling me.

        Compared to Backoff, it has docs, and you don't need to tell it each time you want exponential backoff. (Exponential backoff is what you usually want when you don't want a fixed delay.) You can use Tenacity directly when you need something more complex.

        https://github.com/hynek/stamina

srhtftw a year ago

Happy user of Andrew Moffat's https://sh.readthedocs.io for over a decade.

jmkr a year ago

For audio analysis:

https://librosa.org/

Audio track seperation:

https://github.com/adefossez/demucs

demucs works pretty well.

KolenCh a year ago

Defopt generates cli from function interface: https://defopt.readthedocs.io/en/stable/features.html

The alternative is either you maintain two interfaces with boilerplate, or write a cli only if that’s the first priority.

Similar solutions exist, like fire. But fire’s cli is like an afterthought, in the sense it gives people a way to run things in command line where they already know how to run it from Python.

hcfman a year ago

Opensoundscape for sound localization

https://opensoundscape.org/en/latest/

vismit2000 a year ago

This has been an absolute game changer: https://github.com/pytries/marisa-trie (succinct trie with fast lookup and minimal RAM) Use case and more details in the wonderful book by Ian and Micha: High Performance Python

skykery a year ago

Exporting datasets - Tablib: https://tablib.readthedocs.io/en/stable/

Qem a year ago

boltons: https://boltons.readthedocs.io/en/latest/

Hasnep a year ago

pg8000 [1] is a Postgres library implemented in pure Python. I've spent enough time trying to get psycopg installed on MacOS and Docker that I appreciate just being able to pip install it at any time.

[1] https://github.com/tlocke/pg8000

enjoyyourlife a year ago

genson for creating JSON schemaa

pizza a year ago

- rich

- einops

- pytrees

- torchinfo

- joblib

- symbex

- nbdev

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection