Biography

Kale-ab is a PhD candidate in the Autonomous Agents research group at the University of Edinburgh, supervised by Dr. Stefano Albrecht. His work aims to make Multi-Agent Reinforcement Learning (MARL) algorithms more robust and reliable for real-world collaboration.

Before his PhD, he gained 4.5 years of experience in machine learning, including 2.5 years as a research engineer at InstaDeep, and 3 years of experience in software engineering. Kale-ab is also committed to supporting impactful tech projects in Africa and promoting diversity within the machine learning community.

For more information, you can view my resumé .

Interests
  • Multi-Agent Reinforcement Learning (MARL)
  • Reinforcement Learning
  • Deep Learning
Education
  • MSc in Computer Science, Focusing on Deep Learning (Distinction), 2018 - March, 2021

    University of Witwatersrand

  • Honours in Computer Science (Distinction), 2016

    University of Pretoria

  • BSc. in Computer Science, 2013 - 2015

    University of Pretoria

Recent Publications

(2024). Efficiently Quantifying Individual Agent Importance in Cooperative MARL. (Oral) eXplainable AI approaches for deep reinforcement learning (XAI4DRL) Workshop @ AAAI, 2024.

PDF

(2024). How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning. eXplainable AI approaches for deep reinforcement learning (XAI4DRL) Workshop @ AAAI, 2024.

PDF

(2023). Generalisable Agents for Neural Network Optimisation. WANT@ NeurIPS 2023 and OPT NeurIPS 2023 workshops.

PDF Poster

(2023). Are we going MAD? Benchmarking Multi-Agent Debate between Language Models for Medical Q&A. Deep Generative Models for Health Workshop NeurIPS 2023.

PDF

(2023). Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent Reinforcement Learning. (Oral) Workshop on Reincarnating Reinforcement Learning at ICLR 2023.

PDF

(2022). Just-in-Time Sparsity: Learning Dynamic Sparsity Schedules. Dynamic Neural Networks ICML Workshop 2022.

PDF Poster Slides

(2021). On pseudo-absence generation and machine learning for locust breeding ground prediction in Africa. AI + HADR 2021 and ML4D 2021 NeurIPS Workshops.

PDF Cite Code Blog

(2021). Mava: a research framework for distributed multi-agent reinforcement learning.

PDF Cite Code Blog

(2021). Keep the Gradients Flowing: Using Gradient Flow to Study Sparse Network Optimization. Sparsity in Neural Networks Workshop 2021.

PDF Cite Poster Slides

Open Source

You can see a full list of my open source projects on github - https://github.com/KaleabTessera .

Deep Learning Indaba Practicals 2022
A collection of high-quality practicals covering a variety of modern machine learning techniques.
Deep Learning Indaba Practicals 2022
Pseudo Absence Generation and Locust Prediction
Locust breeding ground prediction using pseudo-absence generation and machine learning.
Pseudo Absence Generation and Locust Prediction
Baobab
Baobab is an open source multi-tenant web application designed to facilitate the application and selection process for large scale meetings within the machine learning and artificial intelligence communities globally.
Baobab
DQN Agent playing Pong
A DQN agent playing pong.
DQN Agent playing Pong
Robotics Navigation
A project that compared the navigation ability of two robots (a Turtlebot and a Kuri robot) in challenging dynamic and static environments.
Robotics Navigation
PRM Path Planning
Implementation of Probabilistic Roadmaps – a path planning algorithm used in Robotics.
PRM Path Planning