Computer Vision

On DINO, Self-Distillation With No Labels

July 13, 2021 by Davide Coccomini

It has been clear for some time that the Transformers had arrived in the field of computer vision to amaze, but hardly anyone could have imagined such astonishing results from a Vision Transformer in such a short time since their first application. In this article, we discuss one of the most interesting advances in the field of computer vision, DINO, announced a few days … [Read more...] about On DINO, Self-Distillation With No Labels

On Transformers, TimeSformers, And Attention

May 25, 2021 by Davide Coccomini

Transformers are a very powerful Deep Learning model that has been able to become a standard in many Natural Language Processing tasks and is poised to revolutionize the field of Computer Vision as well. It all began in 2017 when Google Brain published the paper destined to change everything, Attention Is All You Need [4]. Researchers apply this new architecture to … [Read more...] about On Transformers, TimeSformers, And Attention

New Ways To Leverage AI Image Synthesis For Marketing & Advertising Content

May 22, 2021 by Mariya Yao

This research summary is part of our AI for Marketing series which covers the latest AI & machine learning approaches to 5 aspects of marketing automation: AttributionOptimizationPersonalizationAnalyticsContent Generation: ImagesContent Generation: VideosContent Generation: Text After Generative Adversarial Networks (GANs) were introduced by Ian Goodfellow in 2014, a … [Read more...] about New Ways To Leverage AI Image Synthesis For Marketing & Advertising Content

Novel AI Methods For Video Generation for Marketing & Advertising

May 22, 2021 by Mariya Yao

This research summary is part of our AI for Marketing series which covers the latest AI & machine learning approaches to 5 aspects of marketing automation: AttributionOptimizationPersonalizationAnalyticsContent Generation: ImagesContent Generation: VideosContent Generation: Text AI algorithms can significantly increase the efficiency of the original content generation … [Read more...] about Novel AI Methods For Video Generation for Marketing & Advertising

Step-By-Step Implementation of GANs on Custom Image Data in PyTorch: Part 2

March 11, 2021 by Varshita Sher

In Part 1 on GANs, we started to build intuition regarding what GANs are, why we need them, and how the entire point behind training GANs is to create a generator model that knows how to convert a random noise vector into a (beautiful) almost real image. Since we have already discussed the pseudocode in great depth in Part 1, be sure to check that out as … [Read more...] about Step-By-Step Implementation of GANs on Custom Image Data in PyTorch: Part 2

How I Would Explain GANs From Scratch to a 5-Year Old: Part 1

March 11, 2021 by Varshita Sher

Note: Quite frankly, there are already a zillion articles out there explaining the intuition behind GANs. While I will briefly touch upon it, the rest of the article will be an absolute deep dive into the GAN architecture and mainly coding — but with a very very detailed explanation of the pseudocode (open-sourced as an example by PyTorch on Github). Why do I need … [Read more...] about How I Would Explain GANs From Scratch to a 5-Year Old: Part 1

Why Is Object Detection So Messy?

February 16, 2021 by Ygor Rebouças Serpa

Those working with Neural Networks know how complicated Object Detection techniques can be. It is no wonder there is no straight forward resource for training them. You are always required to convert your data to a COCO-like JSON or some other unwanted format. It is never a plug and play experience. Moreover, no diagram thoroughly explains Faster R-CNN or YOLO as there is for … [Read more...] about Why Is Object Detection So Messy?

Transformers in Computer Vision

January 28, 2021 by Cheng He

Transformer architecture has achieved state-of-the-art results in many NLP (Natural Language Processing) tasks. One of the main breakthroughs with the Transformer model could be the powerful GPT-3 released in the middle of the year, which has been awarded Best Paper at NeurIPS2020. In Computer Vision, CNNs have become the dominant models for vision tasks … [Read more...] about Transformers in Computer Vision

Deep-Learning Based Object Detection in Crowded Scenes

January 14, 2021 by Patrick Langechuan Liu

Object detection in crowded scenes is challenging. When objects gather, they tend to overlap largely with each other, leading to occlusions. Occlusion caused by objects of the same class is called intra-class occlusion, also referred to as crowd occlusion. Object detectors need to determine the locations of different objects in the crowd and accurately delineate their … [Read more...] about Deep-Learning Based Object Detection in Crowded Scenes

NeurIPS 2020 Papers: Takeaways for a Deep Learning Engineer – Computer Vision

January 7, 2021 by Prabhu Prakash Kagitha

As mentioned in part 1– the most important thing:) – I went through all the titles of NeurIPS 2020 papers (more than 1900!) and read abstracts of 175 papers, and extracted DL engineer relevant insights from the following papers. This is part 2. See part 1 here. If this in-depth educational content is useful for you, you can subscribe to our AI research mailing list to be … [Read more...] about NeurIPS 2020 Papers: Takeaways for a Deep Learning Engineer – Computer Vision

« Previous Page

On DINO, Self-Distillation With No Labels

On Transformers, TimeSformers, And Attention

New Ways To Leverage AI Image Synthesis For Marketing & Advertising Content

Novel AI Methods For Video Generation for Marketing & Advertising

Step-By-Step Implementation of GANs on Custom Image Data in PyTorch: Part 2

How I Would Explain GANs From Scratch to a 5-Year Old: Part 1

Why Is Object Detection So Messy?

Transformers in Computer Vision

Deep-Learning Based Object Detection in Crowded Scenes

NeurIPS 2020 Papers: Takeaways for a Deep Learning Engineer – Computer Vision

Bots

Brands

Business

China

Commerce

Computer Vision

Conversational AI

Customer Service

Cybersecurity

Data Science & Engineering

Design

Education

Ethics & Safety

Finance

Gaming

Healthcare

HR & Recruiting

Infrastructure

Leadership & Management

Manufacturing

Marketing

Natural Language Processing

Reinforcement Learning

Research

Retail & CPG

Society

Technical Guide

Technology

About TOPBOTS

Computer Vision

Footer

About TOPBOTS