Researchers create AI framework that predicts object motion from image and tactile information

Recent AI analysis has pointed out the synergies in between touch and vision. One enables the measurement of 3D surface and inertial properties when the other supplies a holistic view of objects’ projected look. Building on this work, researchers at Samsung, McGill University, and York University investigated no matter whether an AI technique could predict the motion of an object from visual and tactile measurements of its initial state.

“Previous research has shown that it is challenging to predict the trajectory of objects in motion, due to the unknown frictional and geometric properties and indeterminate pressure distributions at the interacting surface,” the researchers wrote in a paper describing their work. “To alleviate these difficulties, we focus on learning a predictor trained to capture the most informative and stable elements of a motion trajectory.”

The researchers created a sensor — See-Through-your-Skin — that they claim can capture pictures when giving detailed tactile measurements. Alongside this, they designed a framework named Generative Multimodal Perception that exploits visual and tactile information when readily available to discover a representation that encodes details about object pose, shape, and force and make predictions about object dynamics. And to anticipate the resting state of an object throughout physical interactions, they used what they get in touch with resting state predictions along with a visuotactile dataset of motions in dynamic scenes which includes objects freefalling on a flat surface, sliding down an inclined plane, and perturbed from their resting pose.

In experiments, the researchers say their strategy was in a position to predict the raw visual and tactile measurements of the resting configuration of an object with higher accuracy, with the predictions closely matching the ground truth labels. Moreover, they claim their framework discovered a mapping in between the visual, tactile, and 3D pose modes such that it could deal with missing modalities like when tactile details was unavailable in the input, as nicely as predict situations exactly where an object had fallen from the surface of the sensor, resulting in empty output pictures.

Also Read: The D20 Beat: King’s Bounty II heralds what should really be a great year for RPGs

Also Read: Apple could bring back Parler to App Store if it tends to make these alterations, says Tim Cook

“If a previously unseen object is dropped into a human’s hand, we are able to infer the object’s category and guess at some of its physical properties, but the most immediate inference is whether it will come to rest safely in our palm, or if we need to adjust our grasp on the object to maintain contact,” the coauthors wrote. “[In our work,] we find that predicting object motions in physical scenarios benefits from exploiting both modalities: visual information captures object properties such as 3D shape and location, while tactile information provides critical cues about interaction forces and resulting object motion and contacts.”

What's Hot

She Tried To Take Out Bank Loan For Man In Wheelchair. It Was a Body

Cyber Security Expert From Mumbai Creates Guinness Record For Rotating Puzzle In Soap Bubble

MrBeast and T-Series battle to be the most-subscribed YouTube channel

How can you cultivate a strong financial safety net for your siblings?

Demat Account: What are the investment limits?

Demat Account: Benefits of handling mutual fund investments through demat

RBI to restrain payment aggregators from storing debit, credit card data

Elon Musk expected to meet with space startups during India visit

Growth in wealth management, entertainment spending in FY24: Razorpay

Tomorrow Capital leads $2.7 mm fundraise for kidney care startup VitusCare

B2B marketplace ProcMart raises $30 mn in series B funding from Fundamentum

‘Ethereum Wins Big’ With New US Stablecoin Draft Bill: Expert

Stablecoins Get a Seat at the Table: US Senators Unveil New Regulations

XRP Lawsuit Likely To Reach Supreme Court: Ex-SEC Crypto Chief

Chain Of Exploits? Crypto Hacks Connection Unveiled

Fund review: Bandhan dynamic bond fund

‘Ethereum Wins Big’ With New US Stablecoin Draft Bill: Expert

ICICI Securities downgrades Ashok Leyland to ‘Sell’, sees 20% downside

Ayushman Bharat to cover all above 70 years, says BJP: All you must know

Cyber Security Expert From Mumbai Creates Guinness Record For Rotating Puzzle In Soap Bubble

“Water Is Priceless”: MP Cop’s Post On Water Conservation Goes Viral

US Man, 61, Says He Has Body Of A 38-Year-Old, Shares Biohacking Process

Earth Day: History, Significance And Other Things To Know

X User Says He’s Spending $5,000 To Get To Wife’s Friend’s Wedding, Sparks Discussion

Researchers create AI framework that predicts object motion from image and tactile information

MrBeast and T-Series battle to be the most-subscribed YouTube channel

Huawei starts sales of Pura 70 smartphone in China amid chips scrutiny | Tech News

Senate Democrats urge crackdown on autonomous vehicles and driver assist

Nothing Ear and Ear a launch at 3:30pm: Where to watch live, what to expect | Tech News

New Cisco Hypershield aims to ‘completely reimagine’ security in the AI age

Apple Inc aims to invest over $240 mn to expand its Singapore campus | World News

She Tried To Take Out Bank Loan For Man In Wheelchair. It Was a Body

Cyber Security Expert From Mumbai Creates Guinness Record For Rotating Puzzle In Soap Bubble

MrBeast and T-Series battle to be the most-subscribed YouTube channel

Fund review: Bandhan dynamic bond fund

She Tried To Take Out Bank Loan For Man In Wheelchair. It Was a Body

Cyber Security Expert From Mumbai Creates Guinness Record For Rotating Puzzle In Soap Bubble

MrBeast and T-Series battle to be the most-subscribed YouTube channel

What's Hot

Researchers create AI framework that predicts object motion from image and tactile information

Keep Reading

Subscribe to Updates