Preprint Article Version 2 Preserved in Portico This version is not peer-reviewed

New Solution to 3D Projection in Human-like Binocular Vision

Version 1 : Received: 8 October 2023 / Approved: 8 October 2023 / Online: 8 October 2023 (10:19:31 CEST)
Version 2 : Received: 22 February 2024 / Approved: 22 February 2024 / Online: 22 February 2024 (12:46:40 CET)

How to cite: Xie, M.; Fang, Y.; Lai, T. New Solution to 3D Projection in Human-like Binocular Vision. Preprints 2023, 2023100444. https://doi.org/10.20944/preprints202310.0444.v2 Xie, M.; Fang, Y.; Lai, T. New Solution to 3D Projection in Human-like Binocular Vision. Preprints 2023, 2023100444. https://doi.org/10.20944/preprints202310.0444.v2

Abstract

A human eye has about 120 million rod cells and 6 million cone cells. This huge number of light sensing cells inside a human eye will continuously produce a huge quantity of visual signals which flow into a human brain for daily processing. However, the real-time processing of these visual signals does not cause any fatigue to a human brain. This fact tells us the truth which is to say that human-like vision processes do not rely on complicated formulas to compute depth, displacement, and colors, etc. On the other hand, a human eye is like a PTZ camera. Here, PTZ stands for pan, tilt and zoom. We all know that in computer vision, each set of PTZ parameters (i.e., coefficients of pan, tilt and zoom) requires a dedicated calibration to determine a camera’s projection matrix. Since there is an infinite number of PTZ parameters which could be produced by a human eye, it is unlikely that a human brain stores an infinite number of calibration matrices for each human eye. Therefore, it is an interesting question for us to answer, which is to say whether simpler formulas of computing depth and displacement exist or not. Moreover, these formulas must be calibration friendly (i.e., easy process on the fly or on the go). In this paper, we disclose an important discovery of a new solution to 3D projection in a human-like binocular vision system. The purpose of doing 3D projection in binocular vision is to undertake forward and inverse transformations (or mappings) between coordinates in 2D digital images and coordinates in a 3D analogue scene. The formulas underlying the new solution are accurate, easily computable, easily tunable (i.e., to be calibrated on the fly or on the go) and could be easily implemented by a neural system (i.e., a network of neurons). Experimental results have validated the discovered formulas.

Keywords

Monocular Vision; Binocular Vision; Forward Projection; Inverse Projection; Displacement Projection

Subject

Computer Science and Mathematics, Computer Vision and Graphics

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.