Artificial Intelligence (AI) is the theory and development of computer systems able to perform tasks which normally can only be accomplished by human intelligence.
- Machine Learning (ML) focuses on the development of programs that can learn and change on their own when exposed to new data without needing to be explicitly programmed to do so.
- Deep learning (DL) is a subset of ML and, thanks to advances in computing power, has spurred huge developments in speech and image recognition. An article in MIT Technology Review explained DL this way, “Deep learning software attempts to mimic the activity in layers of neurons in the neocortex, the wrinkly 80 percent of the brain where thinking occurs. The software learns, in a very real sense, to recognize patterns in digital representations of sounds, images, and other data.”
- DL’s secret sauce is simple and very natural for humans, it learns by examples. In the same way that Siri or Alexa tries to recognize the intent of your words, DL algorithms, of which neural networks are the most popular type, recognize and classify content in video and, ultimately, make it searchable.
How Unstructured Video Data is transformed to Actionable Intelligence
Unstructured data refers to information that either does not have a pre-defined data model, or is not organized in a pre-defined manner. This results in irregularities and ambiguities that make it difficult to understand using traditional programs.
Video is full of unstructured data, which means it is typically unreadable by a machine. You can’t search it, compare it against other data, or react to it quickly in critical situations. Working with unstructured video data is like trying to read a book in a language you’ve never seen before. Tangibly, this means footage of a busy lobby or intersection is just that, an incredibly busy camera frame full of people, trucks, cars or vans all crossing the screen, each object registering as meaningful or as meaningless as the next.