Aidan Gomez

Toronto, Ontario, Canada Contact Info

Sign in to view Aidan’s full profile

Welcome back

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

23K followers 500+ connections

View mutual connections with Aidan

Welcome back

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

Join to view profile

Cohere

University of Oxford

Personal Website

About

I'm interested in making massive neural networks more efficient, and getting them…

Activity

Congratulations to Aidan Gomez & the whole Cohere team on their $500M Series D funding round at a $5.5B valuation! Amazing to see how far & quickly…

Congratulations to Aidan Gomez & the whole Cohere team on their $500M Series D funding round at a $5.5B valuation! Amazing to see how far & quickly…

Liked by Aidan Gomez
★富士通×コーヒア、驚きと期待NEWS！！日本AIで。。 ①空間軸　あのOpenAIライバル世界が注視企業と提携　 ②時間軸　過去から国内最高レベルAI研究の歴史…

★富士通×コーヒア、驚きと期待NEWS！！日本AIで。。 ①空間軸　あのOpenAIライバル世界が注視企業と提携　 ②時間軸　過去から国内最高レベルAI研究の歴史…

Liked by Aidan Gomez
💪 🇨🇦 Congrats to two great Canadian companies on the awesome milestones: Cohere and Clio - Cloud-Based Legal Technology. Let's go Jack and Aidan!…

💪 🇨🇦 Congrats to two great Canadian companies on the awesome milestones: Cohere and Clio - Cloud-Based Legal Technology. Let's go Jack and Aidan!…

Liked by Aidan Gomez

Join now to see all activity

Experience & Education

Cohere

********** ** ******

****** ** ********** - *** ******** *******

2018 - 2024
********** ** *******

******* ******** ** ******* ******** ******* ***********

2013 - 2018

View Aidan’s full experience

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Volunteer Experience

Volunteer

Good Shepherd Ministries

Sep 2014 - May 2015 9 months

Poverty Alleviation

The Good Shepherd is a homeless shelter in Toronto where hundreds line up every day to receive breakfast, lunch, and dinner. I had the honour of being able to serve these individuals breakfast and have conversations with my fellow Torontonians. The volunteers and staff that I worked alongside are stunning examples of human empathy, I'm endlessly grateful for what The Good Shepherd has given me.
Journey of Hope

Kawartha Pine Ridge District School Board

Sep 2012 - Feb 2013 6 months

Children

The Journey of Hope is a humanitarian KPRDSB initiative sending students from three schools to Tanzania, Africa. Our roles there ranged from restoration of educational infrastructure, to educating students in computer skills. In addition we donated over 1200 pounds of supplies to various institutions across the country.
Volunteer

The Companions of the Order of Malta, Oxford

Nov 2018 - Present 5 years 9 months

Poverty Alleviation

I've been incredibly fortunate to have been able to spend time having conversations with and serving food and drink to fellow Oxonians.

Publications

The Reversible Residual Network: Backpropagation Without Storing Activations

July 14, 2017
Deep residual networks (ResNets) have significantly pushed forward the state-of-the-art on image classification, increasing in performance as networks grow both deeper and wider. However, memory consumption becomes a bottleneck, as one needs to store the activations in order to calculate gradients using backpropagation. We present the Reversible Residual Network (RevNet), a variant of ResNets where each layer's activations can be reconstructed exactly from the next layer's. Therefore, the…

Deep residual networks (ResNets) have significantly pushed forward the state-of-the-art on image classification, increasing in performance as networks grow both deeper and wider. However, memory consumption becomes a bottleneck, as one needs to store the activations in order to calculate gradients using backpropagation. We present the Reversible Residual Network (RevNet), a variant of ResNets where each layer's activations can be reconstructed exactly from the next layer's. Therefore, the activations for most layers need not be stored in memory during backpropagation. We demonstrate the effectiveness of RevNets on CIFAR-10, CIFAR-100, and ImageNet, establishing nearly identical classification accuracy to equally-sized ResNets, even though the activation storage requirements are independent of depth.

Other authors
See publication
One Model To Learn Them All

Arxiv June 18, 2017
Deep learning yields great results across many fields, from speech recognition, image classification, to translation. But for each problem, getting a deep model to work well involves research into the architecture and a long period of tuning. We present a single model that yields good results on a number of problems spanning multiple domains. In particular, this single model is trained concurrently on ImageNet, multiple translation tasks, image captioning (COCO dataset), a speech recognition…

Deep learning yields great results across many fields, from speech recognition, image classification, to translation. But for each problem, getting a deep model to work well involves research into the architecture and a long period of tuning. We present a single model that yields good results on a number of problems spanning multiple domains. In particular, this single model is trained concurrently on ImageNet, multiple translation tasks, image captioning (COCO dataset), a speech recognition corpus, and an English parsing task. Our model architecture incorporates building blocks from multiple domains. It contains convolutional layers, an attention mechanism, and sparsely-gated layers. Each of these computational blocks is crucial for a subset of the tasks we train on. Interestingly, even if a block is not crucial for a task, we observe that adding it never hurts performance and in most cases improves it on all tasks. We also show that tasks with less data benefit largely from joint training with other tasks, while performance on large tasks degrades only slightly if at all.

Other authors
See publication
Attention Is All You Need

June 12, 2017
The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more…

The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.0 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.

Other authors
See publication
Depthwise Separable Convolutions for Neural Machine Translation

June 9, 2017
Depthwise separable convolutions reduce the number of parameters and computation used in convolutional operations while increasing representational efficiency. They have been shown to be successful in image classification models, both in obtaining better models than previously possible for a given parameter count (the Xception architecture) and considerably reducing the number of parameters required to perform at a given level (the MobileNets family of architectures). Recently, convolutional…

Depthwise separable convolutions reduce the number of parameters and computation used in convolutional operations while increasing representational efficiency. They have been shown to be successful in image classification models, both in obtaining better models than previously possible for a given parameter count (the Xception architecture) and considerably reducing the number of parameters required to perform at a given level (the MobileNets family of architectures). Recently, convolutional sequence-to-sequence networks have been applied to machine translation tasks with good results. In this work, we study how depthwise separable convolutions can be applied to neural machine translation. We introduce a new architecture inspired by Xception and ByteNet, called SliceNet, which enables a significant reduction of the parameter count and amount of computation needed to obtain results like ByteNet, and, with a similar parameter count, achieves new state-of-the-art results. In addition to showing that depthwise separable convolutions perform well for machine translation, we investigate the architectural changes that they enable: we observe that thanks to depthwise separability, we can increase the length of convolution windows, removing the need for filter dilation. We also introduce a new "super-separable" convolution operation that further reduces the number of parameters and computational cost for obtaining state-of-the-art results.

Other authors
See publication
Blog: The Neural Turing Machine

May 18, 2016

A brief outline of the Neural Turing Machine's (NTM) design; a backpropogatable architecture that can (among many possibilities) learn to dynamically execute programs.

See publication
Blog: Backpropogating an LSTM: A Numerical Example

Medium March 17, 2016

LSTMs are arguably the most widely-used architecture in recurrent neural networks. This article walks through the mathematics behind these versatile units.

See publication
Blog: Facebook on the creation of Machine Intelligence

Medium December 12, 2015

An exploration of the technologies and philosophy being used to craft the first generation of artificial intelligence.

See publication

Honors & Awards

AI Grant Fellow

AI Grant

aigrant.org - A fellowship sponsored by Google, CRV and others; started by Nat Friedman (Xamarin) and Daniel Gross (Y Combinator).
Clarendon Scholar

-

clarendon.ox.ac.uk - billed as Oxford’s most competitive graduate scholarship, the Clarendon scholarship is awarded exclusively based on academic performance and contribution.
Open Philanthropy AI Fellow

Open Philanthropy
University College Alumni Scholar

-

Languages

English

Native or bilingual proficiency

More activity by Aidan

Last week, Cohere was at Amazon Web Services (AWS) Summit in New York, where we presented our innovative Enterprise AI solutions. With GenAI at the…

Last week, Cohere was at Amazon Web Services (AWS) Summit in New York, where we presented our innovative Enterprise AI solutions. With GenAI at the…

Liked by Aidan Gomez
Rerank is on Azure!

Rerank is on Azure!

Shared by Aidan Gomez
I've been sharing a lot about new models; today another one is live on Azure AI with Cohere Rerank! We're also adding capabilities to the current…

I've been sharing a lot about new models; today another one is live on Azure AI with Cohere Rerank! We're also adding capabilities to the current…

Liked by Aidan Gomez
Last Wednesday, we hosted the 2nd edition of our AI Demo Days at Cohere, and it was an absolute thrill! 🎉 4 incredible demos, a packed room, and…

Last Wednesday, we hosted the 2nd edition of our AI Demo Days at Cohere, and it was an absolute thrill! 🎉 4 incredible demos, a packed room, and…

Liked by Aidan Gomez
The Cohere platform lets you manage datasets directly through its UI. Upload, organize, and utilize your data. With these datasets, developers can…

The Cohere platform lets you manage datasets directly through its UI. Upload, organize, and utilize your data. With these datasets, developers can…

Liked by Aidan Gomez
Rerank 3 is now available on Microsoft Azure AI Studio. We’re excited for users to leverage our cutting-edge foundation model for efficient search…

Rerank 3 is now available on Microsoft Azure AI Studio. We’re excited for users to leverage our cutting-edge foundation model for efficient search…

Liked by Aidan Gomez
Proud of this one 🙂 We're thrilled to work with TD and their fantastic team of innovators.

Proud of this one 🙂 We're thrilled to work with TD and their fantastic team of innovators.

Liked by Aidan Gomez
Rerank 3 Nimble is now available on Amazon SageMaker. When combined with a generative language model (such as Command R+), Cohere Rerank allows…

Rerank 3 Nimble is now available on Amazon SageMaker. When combined with a generative language model (such as Command R+), Cohere Rerank allows…

Liked by Aidan Gomez
A much faster Rerank – achieving 3x the throughput.

A much faster Rerank – achieving 3x the throughput.

Shared by Aidan Gomez
Introducing Rerank 3 Nimble: the newest foundation model in our Cohere Rerank model series, built to enhance enterprise search and…

Introducing Rerank 3 Nimble: the newest foundation model in our Cohere Rerank model series, built to enhance enterprise search and…

Liked by Aidan Gomez
Cohere is excited to be a partner in the MongoDB AI Applications Program to offer enterprises an accelerated path to building, scaling, and deploying…

Cohere is excited to be a partner in the MongoDB AI Applications Program to offer enterprises an accelerated path to building, scaling, and deploying…

Liked by Aidan Gomez

View Aidan’s full profile

See who you know in common
Get introduced
Contact Aidan directly

Join to view full profile

Sign in

Stay updated on your professional world

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

New to LinkedIn? Join now

Other similar profiles

Explore more posts

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Aidan Gomez

53 others named Aidan Gomez are on LinkedIn

See others named Aidan Gomez

Add new skills with these courses

See all courses

About

Activity

Congratulations to Aidan Gomez & the whole Cohere team on their $500M Series D funding round at a $5.5B valuation! Amazing to see how far & quickly…

Liked by Aidan Gomez

★富士通×コーヒア、驚きと期待NEWS！！日本AIで。。 ①空間軸 あのOpenAIライバル世界が注視企業と提携 ②時間軸 過去から国内最高レベルAI研究の歴史…

Liked by Aidan Gomez

💪 🇨🇦 Congrats to two great Canadian companies on the awesome milestones: Cohere and Clio - Cloud-Based Legal Technology. Let's go Jack and Aidan!…

Liked by Aidan Gomez

Experience & Education

Cohere

********* + ***

View Aidan’s full experience

Volunteer Experience

Volunteer

Journey of Hope

Volunteer

The Companions of the Order of Malta, Oxford

Publications

July 14, 2017

Arxiv June 18, 2017

June 12, 2017

June 9, 2017

May 18, 2016

Medium March 17, 2016

Medium December 12, 2015

Honors & Awards

AI Grant Fellow

AI Grant

Clarendon Scholar

-

Open Philanthropy AI Fellow

Open Philanthropy

University College Alumni Scholar

-

Languages

English

Native or bilingual proficiency

More activity by Aidan

Last week, Cohere was at Amazon Web Services (AWS) Summit in New York, where we presented our innovative Enterprise AI solutions. With GenAI at the…

Liked by Aidan Gomez

Rerank is on Azure!

Shared by Aidan Gomez

I've been sharing a lot about new models; today another one is live on Azure AI with Cohere Rerank! We're also adding capabilities to the current…

Liked by Aidan Gomez

Last Wednesday, we hosted the 2nd edition of our AI Demo Days at Cohere, and it was an absolute thrill! 🎉 4 incredible demos, a packed room, and…

Liked by Aidan Gomez

The Cohere platform lets you manage datasets directly through its UI. Upload, organize, and utilize your data. With these datasets, developers can…

Liked by Aidan Gomez

Rerank 3 is now available on Microsoft Azure AI Studio. We’re excited for users to leverage our cutting-edge foundation model for efficient search…

Liked by Aidan Gomez

Proud of this one 🙂 We're thrilled to work with TD and their fantastic team of innovators.

Liked by Aidan Gomez

Rerank 3 Nimble is now available on Amazon SageMaker. When combined with a generative language model (such as Command R+), Cohere Rerank allows…

Liked by Aidan Gomez

A much faster Rerank – achieving 3x the throughput.

Shared by Aidan Gomez

Introducing Rerank 3 Nimble: the newest foundation model in our Cohere Rerank model series, built to enhance enterprise search and…

Liked by Aidan Gomez

Cohere is excited to be a partner in the MongoDB AI Applications Program to offer enterprises an accelerated path to building, scaling, and deploying…

Liked by Aidan Gomez

View Aidan’s full profile

Sign in

Other similar profiles

Ashish Vaswani

Noam Shazeer

Lukasz Kaiser

Llion Jones

Raquel Urtasun

Sara Hooker

Dario Amodei

Kai-Fu Lee

Andrew Ng

Mike Murchison

Explore more posts

Explore collaborative articles

Others named Aidan Gomez

Aidan Gomez

Aidan Gomez

Aidan Gomez

★富士通×コーヒア、驚きと期待NEWS！！日本AIで。。 ①空間軸　あのOpenAIライバル世界が注視企業と提携　 ②時間軸　過去から国内最高レベルAI研究の歴史…