User profiles for Ting Yao
![]() | Ting YaoHiDream.ai, previously JD.com and Microsoft Research Verified email at hidream.ai Cited by 19663 |
Learning spatio-temporal representation with pseudo-3d residual networks
Convolutional Neural Networks (CNN) have been regarded as a powerful class of models for
image recognition problems. Nevertheless, it is not trivial when utilizing a CNN for learning …
image recognition problems. Nevertheless, it is not trivial when utilizing a CNN for learning …
Video captioning with transferred semantic attributes
Automatically generating natural language descriptions of videos plays a fundamental
challenge for computer vision community. Most recent progress in this problem has been …
challenge for computer vision community. Most recent progress in this problem has been …
Msr-vtt: A large video description dataset for bridging video and language
While there has been increasing interest in the task of describing video with natural language,
current computer vision algorithms are still severely limited in terms of the variability and …
current computer vision algorithms are still severely limited in terms of the variability and …
Memory matching networks for one-shot image recognition
In this paper, we introduce the new ideas of augmenting Convolutional Neural Networks (CNNs)
with Memory and learning to learn the network parameters for the unlabelled images on …
with Memory and learning to learn the network parameters for the unlabelled images on …
Contextual transformer networks for visual recognition
Transformer with self-attention has led to the revolutionizing of natural language processing
field, and recently inspires the emergence of Transformer-style architecture design with …
field, and recently inspires the emergence of Transformer-style architecture design with …
Exploring visual relationship for image captioning
It is always well believed that modeling relationships between objects would be helpful for
representing and eventually describing an image. Nevertheless, there has not been evidence …
representing and eventually describing an image. Nevertheless, there has not been evidence …
Boosting image captioning with attributes
Automatically describing an image with a natural language has been an emerging challenge
in both fields of computer vision and natural language processing. In this paper, we present …
in both fields of computer vision and natural language processing. In this paper, we present …
X-linear attention networks for image captioning
Recent progress on fine-grained visual recognition and visual question answering has
featured Bilinear Pooling, which effectively models the 2nd order interactions across multi-modal …
featured Bilinear Pooling, which effectively models the 2nd order interactions across multi-modal …
Jointly modeling embedding and translation to bridge video and language
Automatically describing video content with natural language is a fundamental challenge of
computer vision. Recurrent Neural Networks (RNNs), which models sequence dynamics, …
computer vision. Recurrent Neural Networks (RNNs), which models sequence dynamics, …
Adjacent Copper Single Atoms Promote C–C Coupling in Electrochemical CO2 Reduction for the Efficient Conversion of Ethanol
…, S Jia, S Han, R Qi, T Chen, X Xing, T Yao… - Journal of the …, 2023 - ACS Publications
The electrochemical CO 2 reduction reaction (CO 2 RR) using renewable electricity is one of
the most promising strategies for reaching the goal of carbon neutrality. Multicarbonous (C 2…
the most promising strategies for reaching the goal of carbon neutrality. Multicarbonous (C 2…