公共文化服务平台

信息隐藏技术在档案馆社会化服务中的应用: 2013年; 在档案馆社会化服务中,档案信息通过互连网提供给广大用户,信息安全是最关键的问题之一。信息隐藏技术是日趋成熟的信息安全技术之一,在许多领域已经得到有效的应用。本文探讨了在档案馆社会化服务中如何应用信息隐藏技术来保障档案信息安全。; 李彩容胡瑞敏涂卫平; 关键词：信息隐藏数字水印信息安全

一种移动音频编码自适应丢帧隐藏算法: 2014年; 针对主流移动音频编码器AVS-P10所采用的语音丢帧隐藏算法在不同丢帧的情况下语音恢复效果存在的不足,提出了一种自适应的语音丢帧隐藏算法,该算法根据不同的坏帧模式自适应地采取不同的坏帧恢复方法,对坏帧中的基音周期参数和ISF系数充分利用前后分别收到的好帧采用更优化的线性预测方法恢复.实验表明:与AVS-P10中的丢帧隐藏算法相比,本文方法在坏帧率低于3%时有更好的语音质量恢复效果.; 项慨胡瑞敏; 关键词：音频信号处理

基于距离差、能量比和到达角的两麦克风声源定位: 2016年; 减少定位声源所需麦克风数量对于特定场合是重要的,提出在两麦克风条件下利用距离差、能量比和到达角联合对单个声源进行二维定位的最大似然法和一种闭式解。在高斯噪声条件下,最大似然法将转化为加权最小二乘问题,为减小求解复杂度利用了粒子群算法进行迭代求解。通过仿真分析了最大似然法和闭式解法的抗噪声性能。; 窦育强王晖张勤; 关键词：到达角最大似然粒子群

Multiple Feature Fusion in Convolutional Neural Networks for Action Recognition被引量：5: 2017年; Action recognition is important for understanding the human behaviors in the video,and the video representation is the basis for action recognition.This paper provides a new video representation based on convolution neural networks（CNN）.For capturing human motion information in one CNN,we take both the optical flow maps and gray images as input,and combine multiple convolutional features by max pooling across frames.In another CNN,we input single color frame to capture context information.Finally,we take the top full connected layer vectors as video representation and train the classifiers by linear support vector machine.The experimental results show that the representation which integrates the optical flow maps and gray images obtains more discriminative properties than those which depend on only one element.On the most challenging data sets HMDB51 and UCF101,this video representation obtains competitive performance.; LI HongyangCHEN JunHU Ruimin

Spatial Perception Reproduction of Sound Event Based on Sound Properties: 2015年; A new method for estimating gain factors in amplitude panning system is proposed. The method is based on particle ve- locity and balanced sound energy formulation. A scale factor is employed in amplitude panning system and thus, an overdeter- mined system of equation is derived in particle velocity equation. To obtain the analytic solution of the overdetermined equation, the sound energy identical formula is considered and then the unique gain factors are estimated. The proposed method is able to repro- duce sound source direction and control the distance perception in a flexible twoor three-dimension loudspeaker setup. Subjective evaluations show that the proposed technique in an aspheric loudspeaker setup maintains the sound direction and controls the distance perception at the listening point.; ZHANG MaoshengHU RuiminCHEN ShihongWANG XiaochenJIANG LinWANG Heng; 关键词：REPRODUCTION PERCEPTION

Action Recognition with Temporal Scale-Invariant Deep Learning Framework被引量：1: 2017年; Recognizing actions according to video features is an important problem in a wide scope of applications. In this paper, we propose a temporal scale.invariant deep learning framework for action recognition, which is robust to the change of action speed. Specifically, a video is firstly split into several sub.action clips and a keyframe is selected from each sub.action clip. The spatial and motion features of the keyframe are extracted separately by two Convolutional Neural Networks(CNN) and combined in the convolutional fusion layer for learning the relationship between the features. Then, Long Short Term Memory(LSTM) networks are applied to the fused features to formulate long.term temporal clues. Finally, the action prediction scores of the LSTM network are combined by linear weighted summation. Extensive experiments are conducted on two popular and challenging benchmarks, namely, the UCF.101 and the HMDB51 Human Actions. On both benchmarks, our framework achieves superior results over the state.of.the.art methods by 93.7% on UCF.101 and 69.5% on HMDB51, respectively.; Huafeng ChenJun ChenRuimin HuChen ChenZhongyuan Wang; 关键词：CNN

基于奇异值分解的稀疏重构近场声源定位: 2016年; 针对近场声源定位问题,提出一种基于奇异值分解的稀疏重构定位方法。该方法通过奇异值分解得到信号子空间,然后在信号子空间约束l1范数求解优化问题实现声源的定位。与直接对接收信号进行稀疏重构相比,该方法通过奇异值分解降低了计算量,有效抑制了噪声。仿真结果表明,与子空间方法相比,其提高了定位的抗噪声性能和分辨率。; 窦育强王晖马赛; 关键词：近场声源定位 L1范数奇异值分解

Robust Multiple Sound Source Localization in Noisy Environment by Using a Soundfield Microphone: Sound source localization techniques are becoming popular as they provide an effective information for paramet...; Jundai SunMaoshen JiaChangchun Bao; 关键词：SPARSITY

Non-Central Zone 3D Sound Field Reproduction for Multichannel System: 2017年; The 22.2 multichannel system and its simplified system with 10-channel and 8-channel have been proposed, which brings people 3 D listening experience. But these systems could only accurately reproduce sound field at a central listening point which is called sweetspot. In order to solve this problem, this paper proposes a non-central zone sound field reproduction method PVMDZ（particle velocity matching between different zones） based on the physical property of sound. The proposed method matches the physical property of sound of non-central zone in reconstructed sound field with that of central zone in original sound field, so the reproduced non-central zone would produce the same listening experience as the central zone of the original system does. By experiments, we compare the performances of the proposed method with the traditional one, and the result proves that the sound field error of proposed method is reduced.; WANG SongZHANG CongPENG BoWANG HengHU Ruimin

面向应急调度的多媒体网关通信模式被引量：3: 2015年; 针对现有文献侧重单一网关功能实现的问题,研究多媒体网关共性和底层通信模式.以应急调度系统业务流程为例,分析了网关数据通信的多任务并发处理流程,设计了一套适合多媒体网关软件开发的通信模式.该模式将多任务分配到线程池的空闲线程队列并发执行,以插件形式封装外部设备,在异构网络环境中实现了媒体设备之间的异步通信.通过会议调度实例指出了IP语音质量优化的可行方案,实验结果表明,以该模式研发的语音网关能很好满足应急调度功能要求,实时响应性能良好.; 徐增敏胡瑞敏陈军禹玮傅佑铭李俊; 关键词：应急通信多任务语音网关回调函数

渝B2-20050021-1　渝公网安备 50019002500403号　违法和不良信息举报中心　互联网出版许可证　新出网证(渝)字10号

国家自然科学基金(61231015)

文献类型

领域

主题

机构

作者

传媒

年份

用户反馈

国家自然科学基金(61231015)

文献类型

领域

主题

机构

作者

传媒

年份

用户登录

用户反馈