Eoin Farrell

Eoin FarrellAstrophysics PhD & AI Researcherhttps://efarrell1.github.io/Impact of binary interaction on the evolution of blue supergiantshttps://efarrell1.github.io/blog/bsg_binaries/https://efarrell1.github.io/blog/bsg_binaries/Published in Farrell et al. (2019), Astronomy & Astrophysics, Volume 621Tue, 01 Jan 2019 00:00:00 GMTastrophysicsNumerical experiments to help understand cause and effect in massive star evolutionhttps://efarrell1.github.io/blog/cause_and_effect_massive_stars/https://efarrell1.github.io/blog/cause_and_effect_massive_stars/Published in Farrell et al. (2022), Monthly Notices of the Royal Astronomical Society, Volume 512, Issue 3Sun, 01 May 2022 00:00:00 GMTastrophysicsInduction head circuits for longer sequenceshttps://efarrell1.github.io/blog/generalised_induction/https://efarrell1.github.io/blog/generalised_induction/In a 2-layer attention-only transformer model, an induction head can combine with an "averaging" head that stores some kind of average over the previous ~4-5 tokens to produce a circuit that can predict the next token in repeated sequences of length 2 to 5 .Tue, 03 Oct 2023 00:00:00 GMTaiIs GW190521 the merger of black holes from the first stellar generations? https://efarrell1.github.io/blog/gw190521/https://efarrell1.github.io/blog/gw190521/Published in Farrell et al. (2021), Monthly Notices of the Royal Astronomical Society: Letters, Volume 502, Issue 1Mon, 01 Mar 2021 00:00:00 GMTastrophysicsThe Initial Magnetic Field Distribution in AB Starshttps://efarrell1.github.io/blog/ifd_ab_stars/https://efarrell1.github.io/blog/ifd_ab_stars/Published in Farrell et al. (2022), The Astrophysical Journal, Volume 938, Issue 1Sat, 01 Oct 2022 00:00:00 GMTastrophysicsExperiments with an alternative method to promote sparsity in sparse autoencodershttps://efarrell1.github.io/blog/l0_loss_function/https://efarrell1.github.io/blog/l0_loss_function/I experimented with alternatives to the standard L1 penalty used to promote sparsity in sparse autoencoders (SAEs). I found that including terms based on an alternative differentiable approximation of the feature sparsity in the loss function was an effective way to generate sparsity in SAEs trained on the residual stream of GPT2-small.Tue, 16 Apr 2024 00:00:00 GMTaiA method for non-linear inversion of the stellar structure applied to gravity-mode pulsatorshttps://efarrell1.github.io/blog/non_linear_stellar_inversions/https://efarrell1.github.io/blog/non_linear_stellar_inversions/Published in Farrell et al. (2024) Astronomy & Astrophysics, Volume 686Thu, 18 Apr 2024 00:00:00 GMTastrophysicsPositional Embeddings in a 2-layer attention-only transformer modelhttps://efarrell1.github.io/blog/pos-embeddings-two_layer/https://efarrell1.github.io/blog/pos-embeddings-two_layer/This post contains some visualisations and discussion of positional embeddings. The position embeddings in a 2-layer attention-only transformer model arrange themselves into a helical structure. This presumably allows the model to generate QK matrices to move a few positions in relative terms with a similar transformation for all positions. The positional embeddings at positions 0 and 1023 have special properties.Sun, 01 Oct 2023 00:00:00 GMTaiThe previous token head and the "look-back-two" head https://efarrell1.github.io/blog/previous-token-head/https://efarrell1.github.io/blog/previous-token-head/A few plots on previous tokens heads, a discussion of how they work and a comparison to a similar type of attention head -- a "look-back-two" head.Mon, 02 Oct 2023 00:00:00 GMTai'Recency bias' in an induction headhttps://efarrell1.github.io/blog/recency-bias-induction-head/https://efarrell1.github.io/blog/recency-bias-induction-head/The induction head in a 2-layer attention-only transformer model has a slight bias towards tokens later in the context compared to earlier. Interestingly, its notion of position appears to not depend on positional embeddings, or any specific output from an attention head in the previous layer.Fri, 06 Oct 2023 00:00:00 GMTaiThe uncertain masses of progenitors of core-collapse supernovae and direct-collapse black holeshttps://efarrell1.github.io/blog/rsg_masses/https://efarrell1.github.io/blog/rsg_masses/Published in Farrell et al. (2020), Monthly Notices of the Royal Astronomical Society, Volume 494, Issue 1Fri, 01 May 2020 00:00:00 GMTastrophysicsExperiments with Sparse Autoencoders on Attention Headshttps://efarrell1.github.io/blog/sae_attention_heads/https://efarrell1.github.io/blog/sae_attention_heads/I trained sparse autoencoders on the key and query vectors of previous token heads and induction heads of attn-only-2l and gpt2-small and found interpretable features which I could intervene on in a predictable and interpretable way.Thu, 21 Dec 2023 00:00:00 GMTaiSNAPSHOT: connections between internal and surface properties of massive starshttps://efarrell1.github.io/blog/snapshot/https://efarrell1.github.io/blog/snapshot/Published in Farrell et al. (2020), Monthly Notices of the Royal Astronomical Society Volume 495, Issue 4 Wed, 01 Jul 2020 00:00:00 GMTastrophysicsUnlearning with sparse autoencodershttps://efarrell1.github.io/blog/unlearning_harry_potter/https://efarrell1.github.io/blog/unlearning_harry_potter/We trained sparse autoencoders on the open-source language model Pythia-2.8b to use for unlearning Harry Potter related knowledge. We can successfully unlearn significant levels of Harry Potter related knowledge with little to no side effects. This technique is worth exploring further.Mon, 03 Jun 2024 00:00:00 GMTai