More

ljvmiranda · on Nov 13, 2023

I'm moving the opposite direction: Altair -> Plotly. I find altair to be too "grammar of graphics" for its own good. And the vega backend makes it hard to hack around. Saving to pdf or high-quality also takes extra steps with additional dependencies.

ljvmiranda · on Sept 1, 2023

They say "art deco is for dwarves while art nouveau is for elves." Forgot where it came from.

keiferski · on Sept 1, 2023

Hah, that is a good way to remember it. Wood elves would be even better.

ljvmiranda · on July 14, 2023

Highly recommend playing Pentiment. Good mystery adventure set in the backdrop of medieval Bavaria. Large part of the story happens in a monastery (and the surrounding town).

bamfly · on July 14, 2023

See also:

https://en.wikipedia.org/wiki/The_Name_of_the_Rose

(it's a mystery, so careful reading too much about it before picking up the book...)

and

https://boardgamegeek.com/boardgame/915/mystery-abbey

ljvmiranda · on Feb 3, 2023

I also see a use-case for reading long fiction books. Recently, I've been reading Stormlight Archives (Book 3) on and off and there are times when I want to remember why character X did Y or what that epigraph means, etc.

mnkm · on Feb 3, 2023

more immersive fiction experiences coming hopefully this month!

definitely subscribe to the newsletter or join the book club to be notified when new things get added.

Book Club: https://discord.com/invite/2yzGD3sSRu

Newsletter: https://konjer.beehiiv.com/subscribe

fragmede · on Feb 3, 2023

ooh that sounds cool but how would you avoid spoilers? "Hi, I'm <character>, I died in chapter 32, so when I did X, it's really because..."

aryamaan · on Feb 5, 2023

The same way xray on kindle and prime works

ljvmiranda · on Nov 29, 2022

I'm looking at the Fediverse explorer and I'm curious how to interpret the embeddings (or the distances between points). What does it mean when two instances are near each other (e.g., sigmoid.social and mastodon-belgium.be)? Is it related to the number of follows?

h4kor · on Nov 29, 2022

Currently I only use the peering information of instances. Instances are places close to each other when they are peering.

As a next step I want to refine this as peering itself does not say anything about how much users of these instances interact. For this I will have to sample the users of instances and who they follow.

ljvmiranda · on Sept 5, 2022

> The gap between tree-based models and deep learning becomes narrower as the dataset size increases (here: 10k -> 50k).

I am curious if there is a sample threshold where it's worth exploring deep learning approaches to tabular data. I wonder if there are other considerations (e.g., inference speed, explainability, etc.).

Tenoke · on Sept 5, 2022

>if there is a sample threshold where it's worth exploring deep learning

Not especially, but there are tasks where DL models seem to occasionally outperform by a little. If you really want to milk extra accuracy it can be worth it to try a DL model, and if it performs as well/better you can use it to make an ensemble along with your GBM or replace the GBM, though it's rare that it is worth it. If you check tabular data kaggle winner writeups most use gbms or an ensemble for a tiny boost over just a GBM.

Assuming limited time to work on the problem, you'd almost always want to focus on further feature engineering first and likely some hyperparameter tuning second.

max_ · on Sept 5, 2022

what's GBM?

howling · on Sept 5, 2022

I assume Gradient Boosting Machine.

wongarsu · on Sept 5, 2022

In practice almost always LightGBM

michaelscott · on Sept 5, 2022

I worked on a little side project for classification on tabular data, but a really challenging use case where the data was prone to a lot of noise and some randomness in the dependent variable. Tree models couldn't get a high enough accuracy, and when the dataset was under roughly 6k entries deep learning performed even worse (as expected).

What was really interesting was when the dataset had more than 6k or so; the deep learning model was suddenly much more accurate and by a wide gap! At roughly the 10k mark, the DL model was outperforming the tree model easily.

beernet · on Sept 5, 2022

It depends on the "DL model", which is a highly vague term. Both a model with 10K parameters and a model with 10T parameters fit this description equally Well.

ljvmiranda · on Aug 23, 2022

Same. How do you use pylint for unit testing? I only use it in my IDE.

CiTyBear · on Aug 23, 2022

Sorry, I meant linting. Editted

ljvmiranda · on Aug 6, 2022

Hi runjake, author here!

Not sure what you meant by the Twitter feed? Doing this since 2017 and it's still fun so far! Hope that helps :)

ljvmiranda · on Aug 5, 2022

My daily driver (Lamy 2000) has a piston converter and I find its capacity quite large, i.e., I need to refill it every week or so.

Cartridges are great too, but I seem stuck with a few options. Lany cartridges are great but it's the only decent one I can find here.

bch · on Aug 5, 2022

> has a piston converter

The 2000 is natively a piston filler as far as I know. When you say “converter”, are you saying you’ve modified your 2000?

ljvmiranda · on Aug 6, 2022

Yes :) The Lamy 2000's been with me after grad. Even had a pen craftsman resharpen my nib!

ljvmiranda · on Aug 5, 2022

Hi there, author here! Been using pen and paper (for knowledge work) since 2017. Hope that helps

user00012-ab · on Aug 5, 2022

that does help, thanks!