1 20-ImageBasics

20-ImageBasics/image.png
While the difficulty of this task has become seemingly easier recently,
the comic still illustrates a useful point.

1.1 Types of image

1.1.1 Vector vs. Raster

1.1.2 Raster

20-ImageBasics/raster_dataset.png
NOTE: (row, col) not (x, y) !!

1.2 Color encoding

20-ImageBasics/rgb.png
20-ImageBasics/368px-Rgb-raster-image.png

1.3 Raster encoding

https://www.youtube.com/watch?v=15aqFQQVBWU

1.4 Indexed format

Raster image,
sometimes called bitmap,
which is really a pixmap format…
20-ImageBasics/BitmapFormat.jpg

Indexed image:
20-ImageBasics/IndexedFormat.jpg

1.5 PNG

https://en.wikipedia.org/wiki/Portable_Network_Graphics
https://www.w3.org/TR/2003/REC-PNG-20031110/index-noobject.html

This image has a transparent background specified by alpha layer:
20-ImageBasics/PNG_transparency_demonstration_1.png
See the background?
Change the color of the background.

1 pixel is:
4-tuple, each of up to 2⁸=256 per element,
with a total of 16,777,216 colors,
plus alpha layer.
20-ImageBasics/fig42.png

PNG has an indexed option, but we won’t focus on it:
20-ImageBasics/index_png.png

How do we select the cells in the image below?
Check out GIMP layers and thresholds (for segmenting)
Show this happening in GIMP on a simple cell image:
https://gitlab.com/bio-data/computer-vision/bio_images/

20-ImageBasics/dna.jpg
GIMP -> Colors -> Threshold

We will show how to do this in code upcoming:
https://en.wikipedia.org/wiki/Otsu's_method

What about this one?
20-ImageBasics/color.jpg
How do we fix the problem?
Does this do it?
GIMP -> Colors -> Desaturate

1.6 Image processing in python

General recommendation:
Do not start with a vision API like OpenCV,
which is high-level, magic,,
but instead, start simple!

Start with numpy, ndimage, and skimage,
and do what you can manually with simple np.arrays,
and the functions in ndimage and skimage.
If that is not enough,
then check out the capabilities of sklearn.
Finally if that’s still not enough,
then you’re ready to dive into OpenCV,
which is has the riches set of pure-vision options,
or do it all from scratch.

1.6.1 The image processing stack in python

1.6.1.1 1. NumPy (numpy)

https://en.wikipedia.org/wiki/NumPy
https://numpy.org/doc/stable/user/
https://numpy.org/learn/
Python NumPy provides support for large, multi-dimensional arrays.
Using NumPy, we can express images as multi-dimensional arrays.

1.6.1.2 2. Scipy (scipy.ndimage)

https://en.wikipedia.org/wiki/SciPy
https://docs.scipy.org/doc/scipy/tutorial/
The ndimage performs image operations at a mid-level of abstraction.
Image processing and analysis operates on two-dimensional arrays of values.
Images of higher dimensionality must also often be analyzed.
Good examples of these are medical imaging and biological imaging.
Python’s numpy is suited very well for this type of applications,
due its inherent multidimensional nature.
The scipy.ndimage package provides a general image processing and analysis functions,
that are designed to operate with arrays of arbitrary dimensionality.
The packages currently includes functions for:
linear and non-linear filtering, binary morphology,
B-spline interpolation, and object measurements.

1.6.1.3 3. Scikit-image (skimage)

https://en.wikipedia.org/wiki/Scikit-image
https://scikit-image.org/docs/stable/user_guide.html
https://scikit-image.org/docs/stable/auto_examples/index.html
The scikit-image SciKit (toolkit for SciPy) extends scipy.ndimage,
to provide a versatile set of image processing routines.
Scikit-image is a collection of algorithms for image processing.
It includes algorithms for:
segmentation, geometric transformations, color space manipulation,
analysis, filtering, morphology, feature detection, and more.
It is designed to interoperate with NumPy and SciPy.

1.6.1.4 4. Scikit-learn (sklearn)

Scikit-learn is a general machine learning library,
with many applications in vision and bioimage informatics
https://scikit-learn.org/stable/auto_examples/
Review vision examples at the above link,
particularly those related to segmentation and labeling.

1.6.2 If you want to take the next step

OpenCV (Open Source Computer Vision Library)
https://en.wikipedia.org/wiki/OpenCV
https://opencv.org/
https://docs.opencv.org/
The library has more than 2500 optimized algorithms,
which includes a comprehensive set of both classic,
and state-of-the-art computer vision and machine learning algorithms.
These algorithms can be used to detect and recognize faces,
identify objects, classify human actions in videos, track camera movements,
track moving objects, extract 3D models of objects and much more.
If you master the basics, and really want to get into vision,
then check this out further.

1.6.3 Honorable mention

Mahotas
https://mahotas.readthedocs.io
Bioimage informatics specific
Originally developed for biological image processing.
Fairly limited in scope, and mostly redundant with skimage.

PILLOW (old version was PIL)
https://python-pillow.org/
https://pillow.readthedocs.io
The Python Imaging Library or PIL allows you to do image processing in Python.
This might be for basic batch stuff, cut, paste, crop, clean,
for managing personal or website pictures, less for scientific purposes.

SimpleCV:
http://simplecv.org/
http://tutorial.simplecv.org/en/latest/
The goal of SimpleCV is to make it easier to do basic image processing and computer vision.
The learning curve is substantially smaller than that of OpenCV,
and as their tag-line says, “it’s computer vision made easy”.
More about the basic image handling, than scientific computing.

1.7 skimage format conventions

1.7.1 Coordinate conventions

Because we represent images with numpy arrays,
our coordinates must match accordingly.
Two-dimensional (2D) grayscale images are indexed by row and columns,
(abbreviated to either (row, col) or (r, c)),
with the lowest element (0, 0), at the top-left corner.
You will also see rr and cc refer to lists of row and column coordinates.
We contrast this against (x, y),
which commonly denote standard Cartesian coordinates,
where x is the horizontal coordinate, y the vertical,
and the origin is on the bottom left.
Matplotlib, for example, uses this convention.
In the case of color (or multi-channel) images,
the last dimension contains the color information,
and is denoted channel or ch.
For 3D images, such as videos,
magnetic resonance imaging (MRI) scans, or confocal microscopy,
we refer to the leading dimension as a plane,
abbreviated as pln or p:

1.7.1.1 Dimension name and order conventions in scikit-image

2D grayscale (row, col)

2D multichannel (eg. RGB) (row, col, ch)

3D grayscale (pln, row, col)

3D multichannel (pln, row, col, ch)

+++++++++++ Cahoot-20-1

1.7.2 Image data types and what they mean

In skimage, images are simply numpy arrays,
which support a variety of data types, i.e. “dtypes”.
To avoid distorting image intensities (see Re-scaling intensity values),
we assume that images use the following dtype ranges:

Data type Range
---------------------
uint8 0 to 255
uint16 0 to 65535
uint32 0 to 2³²
float -1 to 1 or 0 to 1
int8 -128 to 127
int16 -32768 to 32767
int32 -2³¹ to 2³¹ - 1

1.7.3 Input types

Although we aim to preserve the data range and type of input images,
functions may support only a subset of these data-types.
In such a case, the input will be converted to the required type (if possible),
and a warning message printed to the log if a memory copy is needed.
Type requirements should be noted in the doc-strings.
The following utility functions are available to developers and users:

Function name Description
---------------------------
img_as_float Convert to 64-bit floating point.
img_as_ubyte Convert to 8-bit uint.
img_as_uint Convert to 16-bit uint.
img_as_int Convert to 16-bit int.