Computer Vision and AI
Written by Shyam Ashokan ..... 07 February 2025
In the world of machine learning, you often hear words like “neural networks” and “artificial intelligence” being thrown around – often in the mission to simulate a brain entirely from computers. But once that’s sorted, we’d ideally want to use it. Perhaps you might think the logical next step is simulating a pair of eyes to see with – that’s where Computer Vision comes in. Computer Vision is a field of computer science and that target the ability of machines to process, analyse and UNDERSTAND visual information from the real world – entirely by algorithms. This involves breaking down images and videos into structured data, and in the case for BeanSat, it’s vital to quantify our fungi growth.
But how does it really work? At its core, CV is a multi-step process that involves capturing, processing and interpreting images with a typical pipeline of:
- Image Acquisition – Acquiring the raw data from a camera source.
- Pre-processing – Applying algorithms or filters like noise reduction, resizing and colour correction to enhance the data and make it more easily visible.
- Feature Extraction – Identify key characteristics like contours, edges, textures and colours.
- Object Detection and Classification – Recognising and categorising features in the image using Machine Learning models.
- Post-Processing and Analysis – Using the results to make decisions or predictions.
This has been a pivotal breakthrough throughout artificial intelligence, and is ubiquitous throughout modern society. For example in the healthcare sector, CV-powered imaging systems have been used for diagnosing diseases like cancer, identify anomalies in X-rays and even assisting directly with surgeries since the semantic segmentation of the data is far more accurate and quicker than traditional diagnostic methods. In agriculture, CV systems are used to help monitor crop health, detect pests and optimise irrigation. Farmers often use drones or stationary cameras in their fields to maintain crop yields throughout the year and identify any parasites or diseases. And linking back to CanSat, CV has been essential for autonomous navigation and object tracking in extraterrestrial environments like on Mars or the Moon. Robotic systems like the Mars rovers use these technologies to analyse terrain, detect hazards and simply gather data to be processed for the future of science.
Clearly, Computer Vision is crucial for several real-world applications, but several challenges arise when implementing CV – alongside rather sophisticated solutions. One major challenge of computer vision is real-time processing and data transmission. Unlike most static datasets that can quickly be sent over to some processing unit, image and especially video data tends to be very large, containing the entire canvas size, individual pixel colour data and their associated metadata which can quickly lead to bandwidth issues and bottlenecks in analysis. To mitigate this, smart buffering techniques and periodic image updates – similar to that of NASA monitors and even our CanSat – are used. For larger scale projects, cloud-processing could even be a consideration to efficiently handle large datasets by partitioning the workload to several computers.
Another major challenge is the completely uncontrollable environmental variability. Lighting conditions, camera angles, minor distortions, glare from some light of another component: all of these are factors we had to face with designing the camera software as even minor distortions can impact accuracy immensely. This is particularly an issue for us due to the biological nature of our project, any condensation build-up in the petri dish or lighting that obscures the agar’s colour can immensely affect the machine learning models and our final data. To combat this, techniques like adaptive thresholding, histogram equalisation, machine learning-based segmentation and honestly just manually messing around with the colour data with some smart matrix transformations ensures consistency throughout images. Furthermore, fungal growth and structures are generally very irregular. This makes boundary detection and feature extraction incredibly difficult, but future improvements for our can to improve this may be to use infrared or multispectral imaging to capture the miniscule details of the growths, and using self-learning AI models to in fact predict the growth trends of the strains.
The combination of CV and AI in this project has demonstrated the further potential these technologies have to revolutionize biological research. By automating repetitive tasks, accuracy and unlocking new insights, it quite literally paves the way for advancements in agriculture, environmental science and space exploration. Perhaps one day it could even inform the development of life support systems in space-based living conditions and microgravity or other extreme conditions. As AI continues to evolve, more and more gateways into the natural world open - and its potential to unlock new frontiers of knowledge and capability will only grow.