Chapter 8
Fast-AI Fastbook chapter 8, Collaborative Filtering
This chapter will be all about so-called latent factors. Latent factors are underlying concepts in your data that are not up-front, but can be learned by association.
For example, on Netflix you may have watched lots of movies that are science fiction, full of action, and were made in the 1970s. Netflix may not know these particular properties of the films you have watched, but it will be able to see that other people that have watched the same movies that you watched also tended to watch other movies that are science fiction, full of action, and were made in the 1970s. In other words, to use this approach we don’t necessarily need to know anything about the movies, except who like to watch them.
Chapter 7
Fast-AI Fastbook chapter 7, Training a state-of-the-art model
This chapter introduces some more advanced techniques for training an image classification model. To demonstrate this, we will be using imagenette, a subset of imagenet of 10 distinctive categories.
Lets first create a simple model that will serve as our base-line:
dblock = DataBlock(blocks=(ImageBlock(), CategoryBlock()),
get_items=get_image_files,
get_y=parent_label,
item_tfms=Resize(460),
batch_tfms=aug_transforms(size=224, min_scale=0.75))
dls = dblock.dataloaders(path, bs=64)
Nothing new so far in the data set up.
Chapter 6
Fast-AI Fastbook chapter 6, Multi-Category and Regression
So up until now we have learned to do simple image recognition into a single category, and learned some ways to optimize our trainings and improve our models.
In this chapter we will look at two other types of Computer Vision problems: Multi-Category Classification and Regression. In the process we will learn more about output activations, and more types of loss functions.
Multi-Label Classification
This refers to the problem of identifying the categories of objects in images into more than 1 (or 0) categories. With a single-category classification, the model always outputs something, even if you feed it complete trash. That might not be what we want. On the other hand, an image may contain multiple objects, belonging to different categories, and we might want to know about all of them, not just the most prominent one.