LaSeSOM: A Latent and Semantic Representation Framework for Soft Object Manipulation

Peng Zhou; Jihong Zhu; Shengzeng Huo; David Navarro-Alarcon

doi:10.1109/LRA.2021.3074872

LaSeSOM: A Latent and Semantic Representation Framework for Soft Object Manipulation

Peng Zhou, Jihong Zhu, Shengzeng Huo, David Navarro-Alarcon

Learning & Autonomous Control

Research output: Contribution to journal › Article › Scientific › peer-review

11 Citations (Scopus)

26 Downloads (Pure)

Abstract

Soft object manipulation has recently gained popularity within the robotics community due to its potential applications in many economically important areas. Although great progress has been recently achieved in these types of tasks, most state-of-the-art methods are case-specific; They can only be used to perform a single deformation task (e.g. bending), as their shape representation algorithms typically rely on "hard-coded" features. In this paper, we present LaSeSOM, a new feedback latent representation framework for semantic soft object manipulation. Our new method introduces internal latent representation layers between low-level geometric feature extraction and high-level semantic shape analysis; This allows the identification of each compressed semantic function and the formation of a valid shape classifier from different feature extraction levels. The proposed latent framework makes soft object representation more generic (independent from the object's geometry and its mechanical properties) and scalable (it can work with 1D/2D/3D tasks). Its high-level semantic layer enables to perform (quasi) shape planning tasks with soft objects, a valuable and underexplored capability in many soft manipulation tasks. To validate this new methodology, we report a detailed experimental study with robotic manipulators.

Original language	English
Pages (from-to)	5381-5388
Journal	IEEE Robotics and Automation Letters
Volume	6
Issue number	3
DOIs	https://doi.org/10.1109/LRA.2021.3074872
Publication status	Published - 2021

Bibliographical note

Accepted Author Manuscript

Keywords

Bimanual Manipulation
Geodesic Interpolation
Latent Space and Manifolds
Manifolds
Planning
Representation Learning
Semantics
Shape
Shape Deformation Planning
Strain
Task analysis
Three-dimensional displays

Access to Document

10.1109/LRA.2021.3074872

09410363Accepted author manuscript, 3.32 MB

Cite this

@article{7b69b5ac45994150994e631a54742cc8,

title = "LaSeSOM: A Latent and Semantic Representation Framework for Soft Object Manipulation",

abstract = "Soft object manipulation has recently gained popularity within the robotics community due to its potential applications in many economically important areas. Although great progress has been recently achieved in these types of tasks, most state-of-the-art methods are case-specific; They can only be used to perform a single deformation task (e.g. bending), as their shape representation algorithms typically rely on {"}hard-coded{"} features. In this paper, we present LaSeSOM, a new feedback latent representation framework for semantic soft object manipulation. Our new method introduces internal latent representation layers between low-level geometric feature extraction and high-level semantic shape analysis; This allows the identification of each compressed semantic function and the formation of a valid shape classifier from different feature extraction levels. The proposed latent framework makes soft object representation more generic (independent from the object's geometry and its mechanical properties) and scalable (it can work with 1D/2D/3D tasks). Its high-level semantic layer enables to perform (quasi) shape planning tasks with soft objects, a valuable and underexplored capability in many soft manipulation tasks. To validate this new methodology, we report a detailed experimental study with robotic manipulators.",

keywords = "Bimanual Manipulation, Geodesic Interpolation, Latent Space and Manifolds, Manifolds, Planning, Representation Learning, Semantics, Shape, Shape Deformation Planning, Strain, Task analysis, Three-dimensional displays",

author = "Peng Zhou and Jihong Zhu and Shengzeng Huo and David Navarro-Alarcon",

note = "Accepted Author Manuscript",

year = "2021",

doi = "10.1109/LRA.2021.3074872",

language = "English",

volume = "6",

pages = "5381--5388",

journal = "IEEE Robotics and Automation Letters",

issn = "2377-3766",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

number = "3",

}

TY - JOUR

T1 - LaSeSOM

T2 - A Latent and Semantic Representation Framework for Soft Object Manipulation

AU - Zhou, Peng

AU - Zhu, Jihong

AU - Huo, Shengzeng

AU - Navarro-Alarcon, David

N1 - Accepted Author Manuscript

PY - 2021

Y1 - 2021

N2 - Soft object manipulation has recently gained popularity within the robotics community due to its potential applications in many economically important areas. Although great progress has been recently achieved in these types of tasks, most state-of-the-art methods are case-specific; They can only be used to perform a single deformation task (e.g. bending), as their shape representation algorithms typically rely on "hard-coded" features. In this paper, we present LaSeSOM, a new feedback latent representation framework for semantic soft object manipulation. Our new method introduces internal latent representation layers between low-level geometric feature extraction and high-level semantic shape analysis; This allows the identification of each compressed semantic function and the formation of a valid shape classifier from different feature extraction levels. The proposed latent framework makes soft object representation more generic (independent from the object's geometry and its mechanical properties) and scalable (it can work with 1D/2D/3D tasks). Its high-level semantic layer enables to perform (quasi) shape planning tasks with soft objects, a valuable and underexplored capability in many soft manipulation tasks. To validate this new methodology, we report a detailed experimental study with robotic manipulators.

AB - Soft object manipulation has recently gained popularity within the robotics community due to its potential applications in many economically important areas. Although great progress has been recently achieved in these types of tasks, most state-of-the-art methods are case-specific; They can only be used to perform a single deformation task (e.g. bending), as their shape representation algorithms typically rely on "hard-coded" features. In this paper, we present LaSeSOM, a new feedback latent representation framework for semantic soft object manipulation. Our new method introduces internal latent representation layers between low-level geometric feature extraction and high-level semantic shape analysis; This allows the identification of each compressed semantic function and the formation of a valid shape classifier from different feature extraction levels. The proposed latent framework makes soft object representation more generic (independent from the object's geometry and its mechanical properties) and scalable (it can work with 1D/2D/3D tasks). Its high-level semantic layer enables to perform (quasi) shape planning tasks with soft objects, a valuable and underexplored capability in many soft manipulation tasks. To validate this new methodology, we report a detailed experimental study with robotic manipulators.

KW - Bimanual Manipulation

KW - Geodesic Interpolation

KW - Latent Space and Manifolds

KW - Manifolds

KW - Planning

KW - Representation Learning

KW - Semantics

KW - Shape

KW - Shape Deformation Planning

KW - Strain

KW - Task analysis

KW - Three-dimensional displays

UR - http://www.scopus.com/inward/record.url?scp=85104654505&partnerID=8YFLogxK

U2 - 10.1109/LRA.2021.3074872

DO - 10.1109/LRA.2021.3074872

M3 - Article

AN - SCOPUS:85104654505

SN - 2377-3766

VL - 6

SP - 5381

EP - 5388

JO - IEEE Robotics and Automation Letters

JF - IEEE Robotics and Automation Letters

IS - 3

ER -

LaSeSOM: A Latent and Semantic Representation Framework for Soft Object Manipulation

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this