Settings

Theme

Beyond the pixel plane: sensing and learning in 3D

thegradient.pub

129 points by gtmtg 8 years ago · 15 comments

Reader

JeremyHerrman 8 years ago

This is an exciting time for those of us working on computational geometry to better understand 3D shapes across many industries.

In addition to the architectures mentioned in this great overview, I'm excited to see progress on spectral and geodesic CNNs for graphs and manifolds. Check out this other fantastic source for info on 3D ML: http://geometricdeeplearning.com

jeffchuber 8 years ago

This is a great overview. Also checkout CS 468 from Stanford, http://graphics.stanford.edu/courses/cs468-17-spring/ "Machine Learning for 3D Data"

Also, if you want to work on this stuff full time- https://news.ycombinator.com/item?id=17649726

  • bitL 8 years ago

    Super cool course! Thanks for the link!

    Do you know what are the most precise programmable RGB-D cameras a non-professional can buy? I was trying to extract 3D information just from a single camera via 3D convolutions and RNNs (for a self-driving car project) and would like to play with real 3D a bit as well.

    • gtmtgOP 8 years ago

      (I wrote the original article)

      I've been playing around with a few and I'd recommend the Orbbec Astra and the Intel RealSense (the new D435 is what I've been using) as decent but cheap cameras if you want to get started! The Asus Xtion PRO LIVE is also quite good but since it's been discontinued it's pretty hard to find.

      The Stereolabs ZED relies on stereo vision but produces a similar output as traditional RGB-D cameras, and I've heard good things about it as well!

    • jeffchuber 8 years ago

      Not workable for a self-driving car - but for other applications the iPhone X has a front-facing RGB-D sensor.

    • a_t48 8 years ago

      As an alternative, dumping out pixel ground truth from something like Unreal Engine isn’t hard to do.

glalonde 8 years ago

What about pose estimation? e.g. Given a well defined coordinate system, like the origin is the nose on a face, determine the pose of the face. Is this still best done with classic optimization formulations like ransac/ICP and a supplied model, or have these been bested by learned models somehow?

ajmarcic 8 years ago

Has any progress been made towards single view 2D -> 3D inference?

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection