Data Project Management, Code Review Pyramid, Apache Arrow
-
- Increase data accessibility
- Provide faster ROI on data
- Save time for the data team and data consumers
- Provide more precise insights
-
An impressively handy list of open-source startup alternatives to established SaaS products.
-
Snowflake acquired Streamlit for $800MM. That's a really good match.
-
I enjoyed this video of Awesome Pandas Tricks from Advent of Code, not so much for the specific tricks, but more for the illustration of an effective and productive programming style in python (via a notebook or repl).
-
Just plain text files, that's my go to.
-
Gunnar Morling wrote a nice short post with a pyramid illustrating what aspects of code reviews should get the most attention.
-
Related: Do you pull and run code as part of code review? Not an obvious answer, as discussed in the thread.
-
A long but handy article on which Python built-ins should you know about, most of which you already know but a few you may not!
-
One of our teammates pointed me to this important post on Python Dependency Confusion, a vulnerability based on infiltration of malicious code in a public repository like PyPi.
-
This post presents Apache Arrow as an alternative to Pandas dataframes, addressing what it calls Pandas "lousy memory management."
-
4 methodical steps to building effective (and useful) dashboards.