Melchior on CLR

卷积神经网络

任何一个机器学习的任务都可以被拆解为三步

  1. 找到一组函数(model)
  2. 找到评价函数好坏的标准(loss)
  3. 找出最好的函数(optimization)

这三步在神经网络中同样成立。但在神经网络中,要找的不是一组函数,而是一种网络架构(architecture)。本文中的卷积神经网络(CNN)就是一种网络架构

神经网络中的Regularization

Deep learning algorithms are typically applied to extremely complicated domains where the true generation process essentially involves simulating the entire universe… Controlling the complexity of the model is not a simple matter of finding the model of the right size, with the right number of parameters. Instead, we might find—and indeed in practical deep learning scenarios, we almost always do find—that the best fitting model (in the sense of minimizing generalization error) is a large model that has been regularized appropriately