Keyword search (4,163 papers available)

"Zhang H" Authored Publications:

Title Authors PubMed ID
1 Unraveling the resuspension and transformation of stranded oil: Mechanisms driving oil-particle aggregate formation in intertidal zones Yang X; Bi H; Huang G; Zhang H; Lyu L; An C; 40544777
ENCS
2 Semantically-Enhanced Feature Extraction with CLIP and Transformer Networks for Driver Fatigue Detection Gao Z; Chen X; Xu J; Yu R; Zhang H; Yang J; 39771685
ENCS
3 Reduction of Cr(VI) by Bacillus toyonensis LBA36 and its effect on radish seedlings under Cr(VI) stress Tan A; Wang H; Zhang H; Zhang L; Yao H; Chen Z; 39346031
ENCS
4 Inoculation of chromium-tolerant bacterium LBA108 to enhance resistance in radish (Raphanus sativus L.) and combined remediation of chromium-contaminated soil Zhang H; Wang H; Tan A; Zhang L; Yao H; You X; Chen Z; 38721825
ENCS
5 Cooperative Sensitization Upconversion in Solution Dispersions of Co-Crystal Assemblies of Mononuclear Yb3+ and Eu3+ Complexes Sun G; Xie Y; Wang Y; Mandl GA; Maurizio SL; Zhang H; Ottenwaelder X; Capobianco JA; Sun L; 37040148
CNSR
6 Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations Chen Q; Allot A; Leaman R; Islamaj R; Du J; Fang L; Wang K; Xu S; Zhang Y; Bagherzadeh P; Bergler S; Bhatnagar A; Bhavsar N; Chang YC; Lin SJ; Tang W; Zhang H; Tavchioski I; Pollak S; Tian S; Zhang J; Otmakhova Y; Yepes AJ; Dong H; Wu H; Dufour R; Labrak Y; Chatterjee N; Tandon K; Laleye FAA; Rakotoson L; Chersoni E; Gu J; Friedrich A; Pujari SC; Chizhikova M; Sivadasan N; Vg S; Lu Z; 36043400
ENCS

 

Title:Semantically-Enhanced Feature Extraction with CLIP and Transformer Networks for Driver Fatigue Detection
Authors:Gao ZChen XXu JYu RZhang HYang J
Link:https://pubmed.ncbi.nlm.nih.gov/39771685/
DOI:10.3390/s24247948
Publication:Sensors (Basel, Switzerland)
Keywords:CLIP pre-trained modelTransformerfatigue detectioninstance normalizationsemantic analysis
PMID:39771685 Category: Date Added:2025-01-08
Dept Affiliation: ENCS
1 School of Computer Science and Technology, Tongji University, Shanghai 201804, China.
2 Department of Computer Science, City University of Hong Kong, Hong Kong 999077, China.
3 Key Laboratory of Road and Traffic Engineering of the Ministry of Education, Shanghai 201804, China.
4 College of Transportation Engineering, Tongji University, Shanghai 201804, China.
5 Zhejiang Fengxing Huiyun Technology Co., Ltd., Hangzhou 311107, China.
6 Department of Computer Science and Software Engineering, Concordia University, Montreal, QC H3G 1M8, Canada.

Description:

Drowsy driving is a leading cause of commercial vehicle traffic crashes. The trend is to train fatigue detection models using deep neural networks on driver video data, but challenges remain in coarse and incomplete high-level feature extraction and network architecture optimization. This paper pioneers the use of the CLIP (Contrastive Language-Image Pre-training) model for fatigue detection. And by harnessing the power of a Transformer architecture, sophisticated and long-term temporal features are adeptly extracted from video sequences, paving the way for more nuanced and accurate fatigue analysis. The proposed CT-Net (CLIP-Transformer Network) achieves an AUC (Area Under the Curve) of 0.892, a 36% accuracy improvement over the prevalent CNN-LSTM (Convolutional Neural Network-Long Short-Term Memory) end-to-end model, reaching state-of-the-art performance. Experiments show that the CLIP pre-trained model more accurately extracts facial and behavioral features from driver video frames, improving the model's AUC by 7% over the ImageNet-based pre-trained model. Moreover, compared with LSTM, the Transformer more flexibly captures long-term dependencies among temporal features, further enhancing the model's AUC by 4%.





BookR developed by Sriram Narayanan
for the Concordia University School of Health
Copyright © 2011-2026
Cookie settings
Concordia University