LIVIVO - Search results -

Search results

Result 1 - 5 of total 5

Search options

Book ; Online: Towards Automated Circuit Discovery for Mechanistic Interpretability

Conmy, Arthur / Mavor-Parker, Augustine N. / Lynch, Aengus / Heimersheim, Stefan / Garriga-Alonso, Adrià

2023

Abstract: Through considerable effort and intuition, several recent works have reverse-engineered nontrivial behaviors of transformer models. This paper systematizes the mechanistic interpretability process they followed. First, researchers choose a metric and ... ...

Abstract	Through considerable effort and intuition, several recent works have reverse-engineered nontrivial behaviors of transformer models. This paper systematizes the mechanistic interpretability process they followed. First, researchers choose a metric and dataset that elicit the desired model behavior. Then, they apply activation patching to find which abstract neural network units are involved in the behavior. By varying the dataset, metric, and units under investigation, researchers can understand the functionality of each component. We automate one of the process' steps: to identify the circuit that implements the specified behavior in the model's computational graph. We propose several algorithms and reproduce previous interpretability results to validate them. For example, the ACDC algorithm rediscovered 5/5 of the component types in a circuit in GPT-2 Small that computes the Greater-Than operation. ACDC selected 68 of the 32,000 edges in GPT-2 Small, all of which were manually found by previous work. Our code is available at https://github.com/ArthurConmy/Automatic-Circuit-Discovery. Comment: NeurIPS 2023 Spotlight
Keywords	Computer Science - Machine Learning
Subject code	006
Publishing date	2023-04-28
Publishing country	us
Document type	Book ; Online
Database	BASE - Bielefeld Academic Search Engine (life sciences selection)

Full text online

Full text

Inter-library loan at ZB MED

Your chosen title can be delivered directly to ZB MED Cologne location if you are registered as a user at ZB MED Cologne.

Book ; Online: Scattering Networks on the Sphere for Scalable and Rotationally Equivariant Spherical CNNs

McEwen, Jason D. / Wallis, Christopher G. R. / Mavor-Parker, Augustine N.

2021

Abstract: Convolutional neural networks (CNNs) constructed natively on the sphere have been developed recently and shown to be highly effective for the analysis of spherical data. While an efficient framework has been formulated, spherical CNNs are nevertheless ... ...

Abstract	Convolutional neural networks (CNNs) constructed natively on the sphere have been developed recently and shown to be highly effective for the analysis of spherical data. While an efficient framework has been formulated, spherical CNNs are nevertheless highly computationally demanding; typically they cannot scale beyond spherical signals of thousands of pixels. We develop scattering networks constructed natively on the sphere that provide a powerful representational space for spherical data. Spherical scattering networks are computationally scalable and exhibit rotational equivariance, while their representational space is invariant to isometries and provides efficient and stable signal representations. By integrating scattering networks as an additional type of layer in the generalized spherical CNN framework, we show how they can be leveraged to scale spherical CNNs to the high-resolution data typical of many practical applications, with spherical signals of many tens of megapixels and beyond. Comment: 16 pages, 5 figures
Keywords	Computer Science - Computer Vision and Pattern Recognition ; Astrophysics - Instrumentation and Methods for Astrophysics ; Computer Science - Machine Learning ; Electrical Engineering and Systems Science - Image and Video Processing
Subject code	004 ; 006
Publishing date	2021-02-04
Publishing country	us
Document type	Book ; Online
Database	BASE - Bielefeld Academic Search Engine (life sciences selection)

Full text online

Full text

Inter-library loan at ZB MED

Your chosen title can be delivered directly to ZB MED Cologne location if you are registered as a user at ZB MED Cologne.

Book ; Online: A Simple Approach for State-Action Abstraction using a Learned MDP Homomorphism

Mavor-Parker, Augustine N. / Sargent, Matthew J. / Banino, Andrea / Griffin, Lewis D. / Barry, Caswell

2022

Abstract: Animals are able to rapidly infer from limited experience when sets of state action pairs have equivalent reward and transition dynamics. On the other hand, modern reinforcement learning systems must painstakingly learn through trial and error that sets ... ...

Abstract	Animals are able to rapidly infer from limited experience when sets of state action pairs have equivalent reward and transition dynamics. On the other hand, modern reinforcement learning systems must painstakingly learn through trial and error that sets of state action pairs are value equivalent -- requiring an often prohibitively large amount of samples from their environment. MDP homomorphisms have been proposed that reduce the observed MDP of an environment to an abstract MDP, which can enable more sample efficient policy learning. Consequently, impressive improvements in sample efficiency have been achieved when a suitable MDP homomorphism can be constructed a priori -- usually by exploiting a practioner's knowledge of environment symmetries. We propose a novel approach to constructing a homomorphism in discrete action spaces, which uses a partial model of environment dynamics to infer which state action pairs lead to the same state -- reducing the size of the state-action space by a factor equal to the cardinality of the action space. We call this method equivalent effect abstraction. In a gridworld setting, we demonstrate empirically that equivalent effect abstraction can improve sample efficiency in a model-free setting and planning efficiency for modelbased approaches. Furthermore, we show on cartpole that our approach outperforms an existing method for learning homomorphisms, while using 33x less training data. Comment: Previously Presented at the Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM) 2022
Keywords	Computer Science - Machine Learning ; Computer Science - Artificial Intelligence
Subject code	006
Publishing date	2022-09-13
Publishing country	us
Document type	Book ; Online
Database	BASE - Bielefeld Academic Search Engine (life sciences selection)

Full text online

Full text

Inter-library loan at ZB MED

Your chosen title can be delivered directly to ZB MED Cologne location if you are registered as a user at ZB MED Cologne.

Book ; Online: Escaping Stochastic Traps with Aleatoric Mapping Agents

Mavor-Parker, Augustine N. / Young, Kimberly A. / Barry, Caswell / Griffin, Lewis D.

2021

Abstract: Exploration in environments with sparse rewards is difficult for artificial agents. Curiosity driven learning -- using feed-forward prediction errors as intrinsic rewards -- has achieved some success in these scenarios, but fails when faced with action- ... ...

Abstract	Exploration in environments with sparse rewards is difficult for artificial agents. Curiosity driven learning -- using feed-forward prediction errors as intrinsic rewards -- has achieved some success in these scenarios, but fails when faced with action-dependent noise sources. We present aleatoric mapping agents (AMAs), a neuroscience inspired solution modeled on the cholinergic system of the mammalian brain. AMAs aim to explicitly ascertain which dynamics of the environment are unpredictable, regardless of whether those dynamics are induced by the actions of the agent. This is achieved by generating separate forward predictions for the mean and variance of future states and reducing intrinsic rewards for those transitions with high aleatoric variance. We show AMAs are able to effectively circumvent action-dependent stochastic traps that immobilise conventional curiosity driven agents. The code for all experiments presented in this paper is open sourced: http://github.com/self-supervisor/Escaping-Stochastic-Traps-With-Aleatoric-Mapping-Agents. Comment: Previously Presented at the NeurIPS (2020) Biological and Artificial Reinforcement Learning Workshop
Keywords	Computer Science - Machine Learning ; Computer Science - Artificial Intelligence
Subject code	006
Publishing date	2021-02-08
Publishing country	us
Document type	Book ; Online
Database	BASE - Bielefeld Academic Search Engine (life sciences selection)

Full text online

Full text

Inter-library loan at ZB MED

Your chosen title can be delivered directly to ZB MED Cologne location if you are registered as a user at ZB MED Cologne.

Book ; Online: Efficient Generalized Spherical CNNs

Cobb, Oliver J. / Wallis, Christopher G. R. / Mavor-Parker, Augustine N. / Marignier, Augustin / Price, Matthew A. / d'Avezac, Mayeul / McEwen, Jason D.

2020

Abstract: Many problems across computer vision and the natural sciences require the analysis of spherical data, for which representations may be learned efficiently by encoding equivariance to rotational symmetries. We present a generalized spherical CNN framework ...

Abstract	Many problems across computer vision and the natural sciences require the analysis of spherical data, for which representations may be learned efficiently by encoding equivariance to rotational symmetries. We present a generalized spherical CNN framework that encompasses various existing approaches and allows them to be leveraged alongside each other. The only existing non-linear spherical CNN layer that is strictly equivariant has complexity $\mathcal{O}(C^2L^5)$, where $C$ is a measure of representational capacity and $L$ the spherical harmonic bandlimit. Such a high computational cost often prohibits the use of strictly equivariant spherical CNNs. We develop two new strictly equivariant layers with reduced complexity $\mathcal{O}(CL^4)$ and $\mathcal{O}(CL^3 \log L)$, making larger, more expressive models computationally feasible. Moreover, we adopt efficient sampling theory to achieve further computational savings. We show that these developments allow the construction of more expressive hybrid models that achieve state-of-the-art accuracy and parameter efficiency on spherical benchmark problems. Comment: 20 pages, 4 figures, accepted by ICLR, code at https://www.kagenova.com/products/fourpiAI/
Keywords	Computer Science - Computer Vision and Pattern Recognition ; Astrophysics - Instrumentation and Methods for Astrophysics ; Computer Science - Machine Learning
Subject code	004
Publishing date	2020-10-09
Publishing country	us
Document type	Book ; Online
Database	BASE - Bielefeld Academic Search Engine (life sciences selection)

Full text online

Full text

Inter-library loan at ZB MED

Your chosen title can be delivered directly to ZB MED Cologne location if you are registered as a user at ZB MED Cologne.

To top

Search results

Search options

Book ; Online: Towards Automated Circuit Discovery for Mechanistic Interpretability

Full text online

More links

Kategorien

Inter-library loan at ZB MED

Book ; Online: Scattering Networks on the Sphere for Scalable and Rotationally Equivariant Spherical CNNs

Full text online

More links

Kategorien

Inter-library loan at ZB MED

Book ; Online: A Simple Approach for State-Action Abstraction using a Learned MDP Homomorphism

Full text online

More links

Kategorien

Inter-library loan at ZB MED

Book ; Online: Escaping Stochastic Traps with Aleatoric Mapping Agents

Full text online

More links

Kategorien

Inter-library loan at ZB MED

Book ; Online: Efficient Generalized Spherical CNNs

Full text online

More links

Kategorien

Inter-library loan at ZB MED