您的位置: 专家智库 > >

国家自然科学基金(61231015)

作品数:28 被引量:50H指数:3
相关作者:胡瑞敏杨玉红王晖涂卫平陈军更多>>
相关机构:武汉大学河南师范大学中国传媒大学更多>>
发文基金:国家自然科学基金国家高技术研究发展计划湖北省自然科学基金更多>>
相关领域:自动化与计算机技术电子电信文化科学更多>>

文献类型

  • 27篇期刊文章
  • 3篇会议论文

领域

  • 15篇自动化与计算...
  • 14篇电子电信
  • 1篇文化科学

主题

  • 5篇音频
  • 4篇BASED_...
  • 3篇信号
  • 3篇音频编码
  • 2篇信号处理
  • 2篇音频信号
  • 2篇音频信号处理
  • 2篇声源
  • 2篇声源定位
  • 2篇平移
  • 2篇近场
  • 2篇范数
  • 2篇MULTIP...
  • 2篇ROBUST
  • 2篇STRUCT...
  • 2篇CODING
  • 2篇L1范数
  • 1篇带宽
  • 1篇带宽扩展
  • 1篇到达角

机构

  • 12篇武汉大学
  • 3篇中国传媒大学
  • 3篇河南师范大学
  • 2篇湖北经济学院
  • 2篇东华理工大学
  • 2篇湖北大学
  • 1篇桂林电子科技...

作者

  • 7篇胡瑞敏
  • 3篇杨玉红
  • 3篇涂卫平
  • 3篇王晖
  • 2篇姜林
  • 2篇项慨
  • 2篇李彩容
  • 2篇陈军
  • 1篇张茂胜
  • 1篇江俊君
  • 1篇谭小琼
  • 1篇李登实
  • 1篇徐增敏
  • 1篇高戈
  • 1篇梁超
  • 1篇董小慧
  • 1篇张勤
  • 1篇吕海涛
  • 1篇韩镇
  • 1篇傅佑铭

传媒

  • 7篇Wuhan ...
  • 5篇China ...
  • 2篇计算机工程
  • 2篇计算机应用与...
  • 1篇自动化学报
  • 1篇数据采集与处...
  • 1篇武汉大学学报...
  • 1篇湖北大学学报...
  • 1篇武汉大学学报...
  • 1篇计算机应用
  • 1篇电子科技大学...
  • 1篇计算机应用研...
  • 1篇小型微型计算...
  • 1篇中国传媒大学...
  • 1篇当代继续教育

年份

  • 1篇2019
  • 2篇2018
  • 11篇2017
  • 7篇2016
  • 3篇2015
  • 4篇2014
  • 2篇2013
28 条 记 录,以下是 1-10
排序方式:
信息隐藏技术在档案馆社会化服务中的应用
2013年
在档案馆社会化服务中,档案信息通过互连网提供给广大用户,信息安全是最关键的问题之一。信息隐藏技术是日趋成熟的信息安全技术之一,在许多领域已经得到有效的应用。本文探讨了在档案馆社会化服务中如何应用信息隐藏技术来保障档案信息安全。
李彩容胡瑞敏涂卫平
关键词:信息隐藏数字水印信息安全
一种移动音频编码自适应丢帧隐藏算法
2014年
针对主流移动音频编码器AVS-P10所采用的语音丢帧隐藏算法在不同丢帧的情况下语音恢复效果存在的不足,提出了一种自适应的语音丢帧隐藏算法,该算法根据不同的坏帧模式自适应地采取不同的坏帧恢复方法,对坏帧中的基音周期参数和ISF系数充分利用前后分别收到的好帧采用更优化的线性预测方法恢复.实验表明:与AVS-P10中的丢帧隐藏算法相比,本文方法在坏帧率低于3%时有更好的语音质量恢复效果.
项慨胡瑞敏
关键词:音频信号处理
基于距离差、能量比和到达角的两麦克风声源定位
2016年
减少定位声源所需麦克风数量对于特定场合是重要的,提出在两麦克风条件下利用距离差、能量比和到达角联合对单个声源进行二维定位的最大似然法和一种闭式解。在高斯噪声条件下,最大似然法将转化为加权最小二乘问题,为减小求解复杂度利用了粒子群算法进行迭代求解。通过仿真分析了最大似然法和闭式解法的抗噪声性能。
窦育强王晖张勤
关键词:到达角最大似然粒子群
Multiple Feature Fusion in Convolutional Neural Networks for Action Recognition被引量:5
2017年
Action recognition is important for understanding the human behaviors in the video,and the video representation is the basis for action recognition.This paper provides a new video representation based on convolution neural networks(CNN).For capturing human motion information in one CNN,we take both the optical flow maps and gray images as input,and combine multiple convolutional features by max pooling across frames.In another CNN,we input single color frame to capture context information.Finally,we take the top full connected layer vectors as video representation and train the classifiers by linear support vector machine.The experimental results show that the representation which integrates the optical flow maps and gray images obtains more discriminative properties than those which depend on only one element.On the most challenging data sets HMDB51 and UCF101,this video representation obtains competitive performance.
LI HongyangCHEN JunHU Ruimin
Spatial Perception Reproduction of Sound Event Based on Sound Properties
2015年
A new method for estimating gain factors in amplitude panning system is proposed. The method is based on particle ve- locity and balanced sound energy formulation. A scale factor is employed in amplitude panning system and thus, an overdeter- mined system of equation is derived in particle velocity equation. To obtain the analytic solution of the overdetermined equation, the sound energy identical formula is considered and then the unique gain factors are estimated. The proposed method is able to repro- duce sound source direction and control the distance perception in a flexible twoor three-dimension loudspeaker setup. Subjective evaluations show that the proposed technique in an aspheric loudspeaker setup maintains the sound direction and controls the distance perception at the listening point.
ZHANG MaoshengHU RuiminCHEN ShihongWANG XiaochenJIANG LinWANG Heng
关键词:REPRODUCTIONPERCEPTION
Action Recognition with Temporal Scale-Invariant Deep Learning Framework被引量:1
2017年
Recognizing actions according to video features is an important problem in a wide scope of applications. In this paper, we propose a temporal scale.invariant deep learning framework for action recognition, which is robust to the change of action speed. Specifically, a video is firstly split into several sub.action clips and a keyframe is selected from each sub.action clip. The spatial and motion features of the keyframe are extracted separately by two Convolutional Neural Networks(CNN) and combined in the convolutional fusion layer for learning the relationship between the features. Then, Long Short Term Memory(LSTM) networks are applied to the fused features to formulate long.term temporal clues. Finally, the action prediction scores of the LSTM network are combined by linear weighted summation. Extensive experiments are conducted on two popular and challenging benchmarks, namely, the UCF.101 and the HMDB51 Human Actions. On both benchmarks, our framework achieves superior results over the state.of.the.art methods by 93.7% on UCF.101 and 69.5% on HMDB51, respectively.
Huafeng ChenJun ChenRuimin HuChen ChenZhongyuan Wang
关键词:CNN
基于奇异值分解的稀疏重构近场声源定位
2016年
针对近场声源定位问题,提出一种基于奇异值分解的稀疏重构定位方法。该方法通过奇异值分解得到信号子空间,然后在信号子空间约束l1范数求解优化问题实现声源的定位。与直接对接收信号进行稀疏重构相比,该方法通过奇异值分解降低了计算量,有效抑制了噪声。仿真结果表明,与子空间方法相比,其提高了定位的抗噪声性能和分辨率。
窦育强王晖马赛
关键词:近场声源定位L1范数奇异值分解
Robust Multiple Sound Source Localization in Noisy Environment by Using a Soundfield Microphone
Sound source localization techniques are becoming popular as they provide an effective information for paramet...
Jundai SunMaoshen JiaChangchun Bao
关键词:SPARSITY
Non-Central Zone 3D Sound Field Reproduction for Multichannel System
2017年
The 22.2 multichannel system and its simplified system with 10-channel and 8-channel have been proposed, which brings people 3 D listening experience. But these systems could only accurately reproduce sound field at a central listening point which is called sweetspot. In order to solve this problem, this paper proposes a non-central zone sound field reproduction method PVMDZ(particle velocity matching between different zones) based on the physical property of sound. The proposed method matches the physical property of sound of non-central zone in reconstructed sound field with that of central zone in original sound field, so the reproduced non-central zone would produce the same listening experience as the central zone of the original system does. By experiments, we compare the performances of the proposed method with the traditional one, and the result proves that the sound field error of proposed method is reduced.
WANG SongZHANG CongPENG BoWANG HengHU Ruimin
面向应急调度的多媒体网关通信模式被引量:3
2015年
针对现有文献侧重单一网关功能实现的问题,研究多媒体网关共性和底层通信模式.以应急调度系统业务流程为例,分析了网关数据通信的多任务并发处理流程,设计了一套适合多媒体网关软件开发的通信模式.该模式将多任务分配到线程池的空闲线程队列并发执行,以插件形式封装外部设备,在异构网络环境中实现了媒体设备之间的异步通信.通过会议调度实例指出了IP语音质量优化的可行方案,实验结果表明,以该模式研发的语音网关能很好满足应急调度功能要求,实时响应性能良好.
徐增敏胡瑞敏陈军禹玮傅佑铭李俊
关键词:应急通信多任务语音网关回调函数
共3页<123>
聚类工具0