Super-resolution for sequence series data using long-short term memory network

Pak Kan WONG, Man Leung WONG, Kwong Sak LEUNG

Research output: Book Chapters | Papers in Conference ProceedingsConference paper (refereed)Researchpeer-review

Abstract

Sequence Series Data (SSD) refers to multi-dimensional data involving measurements over sequences, which can be ordered. This type of data is frequently encountered in genomic data sets and text sentiment analysis data sets, but collecting them can be time-consuming and labour-intensive. These factors result in low-resolution data sets. Therefore, we employed six machine learning regression methods to perform SSD super-resolution, i.e. to recover high-resolution data sets using self-similarity in low-resolution data sets. Furthermore, we propose a novel Long-Short Term Memory (LSTM) network, namely Interaction Encoded LSTM (IELSTM) network, which is capable of handling multiple distant interactions among sequences. IELSTM network generally shows better overall reconstruction quality when compared with ridge regression, LASSO regression, orthogonal matching pursuit regression, multilayer perceptron regression, and random forest regression, on four genomic data sets.

Original languageEnglish
Title of host publication2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1-8
Number of pages8
Volume2018-January
ISBN (Electronic)9781538627259
DOIs
Publication statusPublished - 2 Feb 2018
Event2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017 - United States, Honolulu, United States
Duration: 27 Nov 20171 Dec 2017

Conference

Conference2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017
CountryUnited States
CityHonolulu
Period27/11/171/12/17
OtherIEEE

Fingerprint

Memory Term
Super-resolution
Series
Multilayer neural networks
Learning systems
Personnel
Regression
Genomics
Orthogonal Regression
Interaction
Text Analysis
Sentiment Analysis
Long short-term memory
Matching Pursuit
Ridge Regression
Multidimensional Data
Random Forest
Self-similarity
Perceptron
Multilayer

Cite this

WONG, P. K., WONG, M. L., & LEUNG, K. S. (2018). Super-resolution for sequence series data using long-short term memory network. In 2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017 - Proceedings (Vol. 2018-January, pp. 1-8). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/SSCI.2017.8280957
WONG, Pak Kan ; WONG, Man Leung ; LEUNG, Kwong Sak. / Super-resolution for sequence series data using long-short term memory network. 2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017 - Proceedings. Vol. 2018-January Institute of Electrical and Electronics Engineers Inc., 2018. pp. 1-8
@inproceedings{3a73fd173c7b4c45a1bcca4956192cb7,
title = "Super-resolution for sequence series data using long-short term memory network",
abstract = "Sequence Series Data (SSD) refers to multi-dimensional data involving measurements over sequences, which can be ordered. This type of data is frequently encountered in genomic data sets and text sentiment analysis data sets, but collecting them can be time-consuming and labour-intensive. These factors result in low-resolution data sets. Therefore, we employed six machine learning regression methods to perform SSD super-resolution, i.e. to recover high-resolution data sets using self-similarity in low-resolution data sets. Furthermore, we propose a novel Long-Short Term Memory (LSTM) network, namely Interaction Encoded LSTM (IELSTM) network, which is capable of handling multiple distant interactions among sequences. IELSTM network generally shows better overall reconstruction quality when compared with ridge regression, LASSO regression, orthogonal matching pursuit regression, multilayer perceptron regression, and random forest regression, on four genomic data sets.",
author = "WONG, {Pak Kan} and WONG, {Man Leung} and LEUNG, {Kwong Sak}",
year = "2018",
month = "2",
day = "2",
doi = "10.1109/SSCI.2017.8280957",
language = "English",
volume = "2018-January",
pages = "1--8",
booktitle = "2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017 - Proceedings",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

WONG, PK, WONG, ML & LEUNG, KS 2018, Super-resolution for sequence series data using long-short term memory network. in 2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017 - Proceedings. vol. 2018-January, Institute of Electrical and Electronics Engineers Inc., pp. 1-8, 2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017, Honolulu, United States, 27/11/17. https://doi.org/10.1109/SSCI.2017.8280957

Super-resolution for sequence series data using long-short term memory network. / WONG, Pak Kan; WONG, Man Leung; LEUNG, Kwong Sak.

2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017 - Proceedings. Vol. 2018-January Institute of Electrical and Electronics Engineers Inc., 2018. p. 1-8.

Research output: Book Chapters | Papers in Conference ProceedingsConference paper (refereed)Researchpeer-review

TY - GEN

T1 - Super-resolution for sequence series data using long-short term memory network

AU - WONG, Pak Kan

AU - WONG, Man Leung

AU - LEUNG, Kwong Sak

PY - 2018/2/2

Y1 - 2018/2/2

N2 - Sequence Series Data (SSD) refers to multi-dimensional data involving measurements over sequences, which can be ordered. This type of data is frequently encountered in genomic data sets and text sentiment analysis data sets, but collecting them can be time-consuming and labour-intensive. These factors result in low-resolution data sets. Therefore, we employed six machine learning regression methods to perform SSD super-resolution, i.e. to recover high-resolution data sets using self-similarity in low-resolution data sets. Furthermore, we propose a novel Long-Short Term Memory (LSTM) network, namely Interaction Encoded LSTM (IELSTM) network, which is capable of handling multiple distant interactions among sequences. IELSTM network generally shows better overall reconstruction quality when compared with ridge regression, LASSO regression, orthogonal matching pursuit regression, multilayer perceptron regression, and random forest regression, on four genomic data sets.

AB - Sequence Series Data (SSD) refers to multi-dimensional data involving measurements over sequences, which can be ordered. This type of data is frequently encountered in genomic data sets and text sentiment analysis data sets, but collecting them can be time-consuming and labour-intensive. These factors result in low-resolution data sets. Therefore, we employed six machine learning regression methods to perform SSD super-resolution, i.e. to recover high-resolution data sets using self-similarity in low-resolution data sets. Furthermore, we propose a novel Long-Short Term Memory (LSTM) network, namely Interaction Encoded LSTM (IELSTM) network, which is capable of handling multiple distant interactions among sequences. IELSTM network generally shows better overall reconstruction quality when compared with ridge regression, LASSO regression, orthogonal matching pursuit regression, multilayer perceptron regression, and random forest regression, on four genomic data sets.

UR - http://www.scopus.com/inward/record.url?scp=85046131786&partnerID=8YFLogxK

U2 - 10.1109/SSCI.2017.8280957

DO - 10.1109/SSCI.2017.8280957

M3 - Conference paper (refereed)

AN - SCOPUS:85046131786

VL - 2018-January

SP - 1

EP - 8

BT - 2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

ER -

WONG PK, WONG ML, LEUNG KS. Super-resolution for sequence series data using long-short term memory network. In 2017 IEEE Symposium Series on Computational Intelligence, SSCI 2017 - Proceedings. Vol. 2018-January. Institute of Electrical and Electronics Engineers Inc. 2018. p. 1-8 https://doi.org/10.1109/SSCI.2017.8280957