Problem in Video Primal Sketch
We list the experimental results in [2] as follows. As shown in Table.1, major model parameters come from representing explicit regions.
As shown in Table.4, major error of video primal sketch comes from reconstructing explicit regions (i.e. sparse coding residual error).
Note we have discussed in the modeling texton section, the authors develop a special dictionary for representing trackable and non-sketchable region. So here comes a problem, since modeling implicit regions cost both less parameters and error, why categorizes trackable and non-sketchable region into explicit region?
A Philosophy Problem
Probability model for the primal sketch representation:
Probability model for the video primal sketch representation:
Two above representations share a similar definition: a probability model with two inconsistent energy measurement (i.e. pixel-wise energy loss and statistics-wise energy loss). As a reuslt, solving above problems suffer from great computational complexity.
If we look at the above problem in a philosophical way, it is similiar with the debate about contrary vs. uniform (also stated as eternal debate by S.C. Zhu). Since the current research philosophy tends to be metaphysics, we usually treat image as separate regions without any connections. I mean, probably in the not far future, more effective representations of texture/texton should have consistent forms.
Vista
Is that even possible? I believe so. In quantum physics, particle wave duality explains the contrary and the uniform relationship. Analogized to this, texture&texton should coexist in the atom of image&video: pixels.
According to Schrödinger Equation, the particle position we observe is the integral of a probability wave. In my opinion, a new intuition of video parsing is: trackable and sketchable motion represented as the integral of a single probability distribution while textured motion as the integral of the composition of several probability wave.
Limited by current computer hardware, the above representation is definitely unsolvable. But I believe, one day, WE WILL REACH THE FINAL SOLUTION. ^_^