Tagged articles

GRU

16 articles · Page 1 of 1

Mar 2, 2026 · Artificial Intelligence

MIT's Pichia-CLM model learns yeast DNA language, boosting protein yield up to 3‑fold

A MIT research team introduced Pichia-CLM, a GRU‑based language model trained on a 27 k‑pair Pichia pastoris dataset that optimizes codon usage, and demonstrated across six proteins that it consistently outperforms four commercial codon‑optimization tools, delivering up to a three‑fold increase in heterologous protein secretion.

Deep LearningGRUPichia pastoris

0 likes · 13 min read

MIT's Pichia-CLM model learns yeast DNA language, boosting protein yield up to 3‑fold

AI Cyberspace

Feb 11, 2026 · Artificial Intelligence

From RNNs to LSTMs and GRUs: A Hands‑On Guide to Sequence Modeling in PyTorch

This tutorial explains the nature of sequential data, why traditional feed‑forward networks struggle with it, and how recurrent architectures such as RNN, LSTM, and GRU capture temporal dependencies, complete with mathematical foundations, training algorithms, and full PyTorch implementations for sentiment analysis, text generation, and encoder‑decoder models.

Encoder-DecoderGRULSTM

0 likes · 57 min read

From RNNs to LSTMs and GRUs: A Hands‑On Guide to Sequence Modeling in PyTorch

Data Party THU

Jan 25, 2026 · Big Data

How Tsinghua’s Big Data Initiative Boosted Refinery Energy Forecasts with GRU

The Tsinghua University Big Data Capability Project applied GRU‑based deep learning, pulse‑event encoding, and advanced feature engineering to transform discrete refinery energy data into continuous sequences, achieving prediction accuracies of 84.2%, 82.7% and 81.6% for fuel gas, medium‑pressure and low‑pressure steam respectively.

GRUenergy predictionfeature engineering

0 likes · 9 min read

How Tsinghua’s Big Data Initiative Boosted Refinery Energy Forecasts with GRU

Bighead's Algorithm Notes

Dec 2, 2025 · Artificial Intelligence

Dual-Relation Fusion Network (DRFN) for Accurate Stock Prediction

The paper introduces DRFN, a dual‑relation fusion network that jointly models static and dynamic stock relationships using multimodal BERT and GRU encodings, achieving significantly lower RMSE and MAE than baseline models on both Chinese and US market datasets.

BERTGRUGraph Neural Network

0 likes · 11 min read

Dual-Relation Fusion Network (DRFN) for Accurate Stock Prediction

Data Party THU

Sep 9, 2025 · Big Data

Our Winning Strategies for the 2025 China University Big Data Stock Prediction Contest

Competing in the 2025 China University Big Data Challenge, our interdisciplinary team leveraged GRU-based time‑series modeling, XGBoost feature fusion, risk‑based clustering, and a custom asymmetric loss to achieve a fourth‑place finish while highlighting the limitations of relying solely on SHSE 300 stock data.

GRUXGBoostensemble learning

0 likes · 5 min read

Our Winning Strategies for the 2025 China University Big Data Stock Prediction Contest

Qborfy AI

Aug 7, 2025 · Artificial Intelligence

Understanding RNNs: From Memory Cells to Real‑World Applications

This article explains how recurrent neural networks (RNNs) add memory to neural models, details the gate mechanisms of LSTM and GRU, compares their structures and parameter counts, and illustrates their use in speech recognition, translation, stock prediction, and video generation, while highlighting practical insights and energy considerations.

AIDeep LearningGRU

0 likes · 5 min read

Understanding RNNs: From Memory Cells to Real‑World Applications

Alibaba Cloud Developer

Aug 6, 2025 · Artificial Intelligence

How Transformers Revolutionize Sequence Modeling: From RNN Limits to Self‑Attention Mastery

This article explains why Transformer models surpass traditional RNN‑based seq2seq architectures by introducing self‑attention, multi‑head attention, and positional encoding, detailing the inner workings of encoders, decoders, and attention mechanisms, and comparing their advantages and limitations across NLP and vision tasks.

GRULSTMRNN

0 likes · 30 min read

How Transformers Revolutionize Sequence Modeling: From RNN Limits to Self‑Attention Mastery

Network Intelligence Research Center (NIRC)

Sep 21, 2023 · Artificial Intelligence

RVM: Real-Time High-Resolution Video Matting

The article reviews the paper "Robust High-Resolution Video Matting with Temporal Guidance", detailing a GRU‑based multi‑task network that achieves real‑time performance on 4K (76 FPS) and 1080p (104 FPS) video by leveraging temporal information and semantic segmentation.

GRUMobileNetV3Multi-Task Learning

0 likes · 5 min read

RVM: Real-Time High-Resolution Video Matting

DataFunTalk

Apr 3, 2023 · Artificial Intelligence

Implementing RNN, LSTM, and GRU with PyTorch

This article introduces the basic architectures of recurrent neural networks (RNN), LSTM, and GRU, explains PyTorch APIs such as nn.RNN, nn.LSTM, nn.GRU, details their parameters, demonstrates code examples for building and testing these models, and provides practical insights for deep learning practitioners.

Deep LearningGRULSTM

0 likes · 9 min read

Implementing RNN, LSTM, and GRU with PyTorch

Douyu Streaming

Oct 20, 2021 · Artificial Intelligence

How DeepXi and MHANet Revolutionize Speech Enhancement with Multi‑Head Attention

DeepXi introduces a two‑stage deep learning framework for speech enhancement, using prior SNR estimation and MMSE gain, while the MHANet extension leverages multi‑head attention to model long‑range dependencies, with detailed training strategies, model compression to GRU, deployment via TFLite, and impressive low‑latency results.

Deep LearningGRUMulti-Head Attention

0 likes · 8 min read

How DeepXi and MHANet Revolutionize Speech Enhancement with Multi‑Head Attention

58 Tech

Oct 12, 2021 · Artificial Intelligence

Seq2Seq Approaches for Phone Number Extraction from Two‑Speaker Voice Dialogues

This article presents a practical study of extracting phone numbers from two‑speaker voice dialogues using Seq2Seq models—including LSTM, GRU with attention and feature fusion, and Transformer—detailing data characteristics, model architectures, training strategies, experimental results, and comparative analysis showing the GRU‑Attention approach achieving the best performance.

GRULSTMNLP

0 likes · 13 min read

Seq2Seq Approaches for Phone Number Extraction from Two‑Speaker Voice Dialogues

TiPaiPai Technical Team

Jul 9, 2021 · Artificial Intelligence

How Multi‑Scale Attention and DenseNet Boost Handwritten Math Expression Recognition

This article reviews a CVPR 2018 paper that introduces a dense‑connected encoder and multi‑scale attention mechanism to improve handwritten mathematical expression recognition, detailing the background, network architecture, GRU decoder, loss function, and experimental gains over previous methods.

DenseNetGRUcomputer vision

0 likes · 8 min read

How Multi‑Scale Attention and DenseNet Boost Handwritten Math Expression Recognition

DataFunTalk

Dec 13, 2019 · Artificial Intelligence

Fundamentals of Deep Learning: Neural Networks, CNNs, RNNs, LSTM, and GRU

This article provides a comprehensive overview of deep learning fundamentals, covering neural network basics, forward and backward feedback architectures, key models such as MLP, CNN, RNN, LSTM and GRU, training techniques like gradient descent, learning rate schedules, momentum, weight decay, and batch normalization.

CNNDeep LearningGRU

0 likes · 14 min read

Fundamentals of Deep Learning: Neural Networks, CNNs, RNNs, LSTM, and GRU

Xianyu Technology

May 10, 2018 · Artificial Intelligence

Mercari Price Prediction Using TFIDF, GRU, and Ensemble Models

By converting Mercari’s product titles, descriptions, and categorical data into TF‑IDF vectors and embeddings, training MLP and GRU networks, and ensembling them with weighted averaging, the authors achieve a 0.3873 RMSLE—matching the competition’s top score—and demonstrate the power of text‑only price prediction for C2C marketplaces.

EnsembleGRUTFIDF

0 likes · 8 min read

Mercari Price Prediction Using TFIDF, GRU, and Ensemble Models

Liulishuo Tech Team

Aug 11, 2017 · Artificial Intelligence

DeepGrammar: A Neural Network Approach for Grammatical Error Detection and Correction

DeepGrammar is a bidirectional GRU‑based neural model that detects subject‑verb agreement errors by encoding surrounding context into fixed‑length vectors, outperforming rule‑based, classifier, and NMT approaches on the CoNLL‑2014 benchmark and achieving state‑of‑the‑art results across multiple error types.

GRUgrammar correctionneural networks

0 likes · 8 min read

DeepGrammar: A Neural Network Approach for Grammatical Error Detection and Correction

dbaplus Community

Jan 19, 2017 · Artificial Intelligence

Mastering LSTM: Architecture, Forward/Backward Computation, and Implementation

This article provides a comprehensive guide to Long Short-Term Memory networks, covering their motivation, detailed forward and backward equations, gate mechanisms, training algorithm, gradient checking, and a full C++ implementation, while also introducing the simpler GRU variant.

BackpropagationDeep LearningGRU

0 likes · 12 min read

Mastering LSTM: Architecture, Forward/Backward Computation, and Implementation