Ruhui Ma

Shanghai Jiao Tong University

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Trends

Author Order

Document Type

Co-Authors

Haibing Guan

Shanghai Jiao Tong University

101

Tao Song

Shanghai Jiao Tong University

Hua Yang

Health Economics and Outcomes Research (United Kingdom)

Zhengui Xue

Queen's University Belfast

Hongjian Shi

Shanghai Jiao Tong University

Jian Li

Fudan University

Zinuo Cai

Shanghai Jiao Tong University

Liwei Lin

Fujian University of Technology

Alei Liang

Nanjing University of Chinese Medicine

Zhengwei Qi

Shanghai Jiao Tong University

Cooperative Institutions

Shanghai Jiao Tong University

102

Chinese Academy of Sciences

Zhejiang University

University of Chinese Academy of Sciences

Tsinghua University

Jiangnan University

Chinese Academy of Agricultural Sciences

Beijing Institute of Technology

Wuhan University

Ruijin Hospital

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Field

Hybrid QPSO based wavelet neural networks for network anomaly detection

Ruhui Ma Yuan Liu Xing Lin

In this paper, a novel hybrid algorithm based wavelet neural network (WNN) is proposed for network anomaly detection. This new evolutionary algorithm, which is based on a hybrid of quantum-behaved particle swarm optimization (QPSO) and conjugate gradient algorithm (CG), is employed to train WNN. The quantum-behaved particle swarm optimization, which outperforms other optimization algorithm considerably on its simple architecture and fast convergence, has previously applied to solve optimum problem. Due to the particles in the multi-dimensional space seeking the best position so quickly, it would result in the dangerous of stagnation, which would make the QPSO impossible to arrive at the global optimum. In order to overcome defects of QPSO, the improved hybrid algorithm was proposed. Experimental result on KDD 99 intrusion detection datasets shows that this WNN using the novel hybrid algorithm has high detection rate while maintaining a low false positive rate.

Position (finance)

10.1109/dmamh.2007.69

Cite

Citations (6)

Hierarchical Satellite System Graph for Approximate Nearest Neighbor Search on Big Data

ACM/IMS Transactions on Data Science (2021)

Jiaru Zhang Ruhui Ma Tao Song Hua Yang Zhengui Xue

Approximate nearest neighbor search is a classical problem in data science, which is widely applied in many fields. With the rapid growth of data in the real world, it becomes more and more important to speed up the nearest neighbor search process. Satellite System Graph (SSG) is one of the state-of-the-art methods to solve the problem. However, with the further increase of the data scale of problems, SSG still needs a considerable amount of time to finish the search due to the limitation of step length and start point locations. To solve the problem, we propose Hierarchical Satellite System Graph (HSSG) and present its index algorithm and search algorithm. The index process can be distributed deployed due to the good parallelism of our designed hierarchical structure. The theoretical analysis reveals that HSSG decreases the search steps and reduces the computational cost and reduces the search time by searching on the hierarchical structure with a similar indexing time compared with SSG, hence reaches a better search efficiency. The experiments on multiple datasets present that HSSG reduces the distance computations, accelerates the search process, and increases the search precision in the real tasks, especially under the tasks with large scale and crowded distributions, which presents a good application prospect of HSSG.

Best bin first

10.1145/3488377

Cite

Citations (2)

ForenVisor: A Tool for Acquiring and Preserving Reliable Data in Cloud Live Forensics

IEEE Transactions on Cloud Computing (2016)

Zhengwei Qi Chengcheng Xiang Ruhui Ma Jian Li Haibing Guan

Live forensics is an important technique in cloud security but is facing the challenge of reliability. Most of the live forensic tools in cloud computing run either in the target Operating System (OS), or as an extra hypervisor. The tools in the target OS are not reliable, since they might be deceived by the compromised OS. Furthermore, traditional general purpose hypervisors are vulnerable due to their huge code size. However, some modules of a general purpose hypervisor, such as device drivers, are indeed unnecessary for forensics. In this paper, we propose a special purpose hypervisor, called ForenVisor, which is dedicated to reliable live forensics. The reliability is improved in three ways: reducing Trusted Computing Base (TCB) size by leveraging a lightweight architecture, collecting evidence directly from the hardware, and protecting the evidence and other sensitive files with Filesafe module. We have implemented a proof-of-concept prototype on the Windows platform, which can acquire the process data, raw memory, and I/O data, such as keystrokes and network traffic. Furthermore, we evaluate ForenVisor in terms of code size, functionality, and performance. The experiment results show that ForenVisor has a relatively small TCB size of about 13 KLOC, and only causes less than 10 percent performance reduction to the target system. In particular, our experiments verify that ForenVisor can guarantee that the protected files remain untampered, even when the guest OS is compromised by viruses, such as `ILOVEYOU' and Worm.WhBoy. Also, our system can be loaded as a hypervisor without needing to pause the target OS. This allows it to not only avoid destructing but also to gather the live evidence of the target OS. We also posted the source code of ForenVisor on Github.

10.1109/tcc.2016.2535295

Cite

Citations (36)

Code cache management based on working set in dynamic binary translator

Computer Science and Information Systems (2011)

Ruhui Ma Haibing Guan Erzhou Zhu Yongqiang Gao Alei Liang

Software code cache employed to store translated or optimized codes, amortizes the overhead of dynamic binary translation via reusing of stored-altered copies of original program instructions. Though many conventional code cache managements, such as Flush, Least-Recently Used (LRU), have been applied on some classic dynamic binary translators, actually they are so unsophisticated yet unadaptable that it not only brings additional unnecessary overhead, but also wastes much cache space, since there exist several noticeable features in software code cache, unlike pages in memory. Consequently, this paper presents two novel alternative cache schemes-SCC (Static Code Cache) and DCC (Dynamic Code Cache) based on working set. In these new schemes, we utilize translation rate to judge working set. To evaluate these new replacement policies, we implement them on dynamic binary translator-CrossBit with several commonplace code cache managements. Through the experiment results based on benchmark SPECint 2000, we achieve better performance improvement and cache space utilization ratio.

Binary translation

Cache pollution

Cache invalidation

Smart Cache

Page cache

Cache-oblivious algorithm

Code (set theory)

Working set

Benchmark (surveying)

Code reuse

10.2298/csis100327022m

Cite

Citations (1)

When I/O Interrupt Becomes System Bottleneck: Efficiency and Scalability Enhancement for SR-IOV Network Virtualization

IEEE Transactions on Cloud Computing (2017)

Jian Li Shuai Xue Wang Zhang Ruhui Ma Zhengwei Qi

High performance networking interface cards (NIC) have become essential networking devices in commercial cloud computing environments. Therefore, efficient and scalable I/O virtualization is one of the primary challenges on virtualized cloud computing platforms. Single Root I/O Virtualization (SR-IOV) is a network interface technology that eliminates the overhead of redundant data copies and the virtual network switches through direct I/O in order to achieve nearly natural I/O performance. However, the SR-IOV still suffers from serious problems due to the high overhead for processing excessive network interrupts as well as the unpredictable and bursty traffic load in high-speed networking connections. In this paper, the defects of SR-IOV with 10 Gigabit Ethernet networking are studied first and two major challenges are identified: excessive interrupt rate and single threaded virtual network driver. Second, two interrupt rate control optimization schemes, called coarse-grained interrupt rate (CGR) control and adaptive interrupt rate (AIR) control are proposed. The proposed control schemes can significantly reduce the overhead and enhance the SR-IOV performance compared with the traditional driver with fixed interrupt throttle rate (FIR). In addition, multi-threaded VF driver (MTVD) is proposed that allows the SR-IOV VFs to leverage multi-core resources in order to achieve high scalability. Finally, these optimizations are implemented and detailed performance evaluations are conducted. The results show that CGR and AIR can improve the throughput by 2.26× and 2.97× while saving the CPU resources by 1.23 core and 1.44 core, respectively. The MTVD can achieve 2.03× performance with additional 1.46 cores consumption for VM using the SR-IOV driver.

10.1109/tcc.2017.2712686

Cite

Citations (26)

Performance Enhancement for Network I/O Virtualization with Efficient Interrupt Coalescing and Virtual Receive-Side Scaling

IEEE Transactions on Parallel and Distributed Systems (2012)

Haibing Guan Yaozu Dong Ruhui Ma Dongxiao Xu Yang Zhang

Virtualization is a key technology in cloud computing; it can accommodate numerous guest VMs to provide transparent services, such as live migration, high availability, and rapid checkpointing. Cloud computing using virtualization allows workloads to be deployed and scaled quickly through the rapid provisioning of virtual machines on physical machines. However, I/O virtualization, particularly for networking, suffers from significant performance degradation in the presence of high-speed networking connections. In this paper, we first analyze performance challenges in network I/O virtualization and identify two problems-conventional network I/O virtualization suffers from excessive virtual interrupts to guest VMs, and the back-end driver does not efficiently use the computing resources of underlying multicore processors. To address these challenges, we propose optimization methods for enhancing the networking performance: 1) Efficient interrupt coalescing for network I/O virtualization and 2) virtual receive-side scaling to effectively leverage multicore processors. These methods are implemented and evaluated with extensive performance tests on a Xen virtualization platform. Our experimental results confirm that the proposed optimizations can significantly improve network I/O virtualization performance and effectively solve the performance challenges.

Hardware virtualization

Network Virtualization

Provisioning

Application virtualization

Storage virtualization

10.1109/tpds.2012.339

Cite

Citations (18)

Robust Bayesian Neural Networks by Spectral Expectation Bound Regularization

2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021)

Jiaru Zhang Hua Yang Zhengui Xue Tao Song Chengyu Zheng

Bayesian neural networks have been widely used in many applications because of the distinctive probabilistic representation framework. Even though Bayesian neural networks have been found more robust to adversarial attacks compared with vanilla neural networks, their ability to deal with adversarial noises in practice is still limited. In this paper, we propose Spectral Expectation Bound Regularization (SEBR) to enhance the robustness of Bayesian neural networks. Our theoretical analysis reveals that training with SEBR improves the robustness to adversarial noises. We also prove that training with SEBR can reduce the epistemic uncertainty of the model and hence it can make the model more confident with the predictions, which verifies the robustness of the model from another point of view. Experiments on multiple Bayesian neural network structures and different adversarial attacks validate the correctness of the theoretical findings and the effectiveness of the proposed approach.

Robustness

Deep Neural Networks

Regularization

10.1109/cvpr46437.2021.00381

Cite

Citations (3)

Unsupervised Video Summarization with Attentive Conditional Generative Adversarial Networks

Proceedings of the 30th ACM International Conference on Multimedia (2019)

Xufeng He Hua Yang Tao Song Zongpu Zhang Zhengui Xue

With the rapid growth of video data, video summarization technique plays a key role in reducing people's efforts to explore the content of videos by generating concise but informative summaries. Though supervised video summarization approaches have been well studied and achieved state-of-the-art performance, unsupervised methods are still highly demanded due to the intrinsic difficulty of obtaining high-quality annotations. In this paper, we propose a novel yet simple unsupervised video summarization method with attentive conditional Generative Adversarial Networks (GANs). Firstly, we build our framework upon Generative Adversarial Networks in an unsupervised manner. Specifically, the generator produces high-level weighted frame features and predicts frame-level importance scores, while the discriminator tries to distinguish between weighted frame features and raw frame features. Furthermore, we utilize a conditional feature selector to guide GAN model to focus on more important temporal regions of the whole video frames. Secondly, we are the first to introduce the frame-level multi-head self-attention for video summarization, which learns long-range temporal dependencies along the whole video sequence and overcomes the local constraints of recurrent units, e.g., LSTMs. Extensive evaluations on two datasets, SumMe and TVSum, show that our proposed framework surpasses state-of-the-art unsupervised methods by a large margin, and even outperforms most of the supervised methods. Additionally, we also conduct the ablation study to unveil the influence of each component and parameter settings in our framework.

Discriminator

Margin (machine learning)

Feature (linguistics)

Key frame

Feature Learning

Generative model

10.1145/3343031.3351056

Cite

Citations (76)

GPFL: Simultaneously Learning Global and Personalized Feature Information for Personalized Federated Learning

arXiv (Cornell University) (2023)

Jianqing Zhang Hua Yang Hao Wang Tao Song Zhengui Xue

Federated Learning (FL) is popular for its privacy-preserving and collaborative learning capabilities. Recently, personalized FL (pFL) has received attention for its ability to address statistical heterogeneity and achieve personalization in FL. However, from the perspective of feature extraction, most existing pFL methods only focus on extracting global or personalized feature information during local training, which fails to meet the collaborative learning and personalization goals of pFL. To address this, we propose a new pFL method, named GPFL, to simultaneously learn global and personalized feature information on each client. We conduct extensive experiments on six datasets in three statistically heterogeneous settings and show the superiority of GPFL over ten state-of-the-art methods regarding effectiveness, scalability, fairness, stability, and privacy. Besides, GPFL mitigates overfitting and outperforms the baselines by up to 8.99% in accuracy.

Overfitting

Feature (linguistics)

Personalized Learning

10.48550/arxiv.2308.10279

Cite

Citations (1)

pFedEff: An Efficient and Personalized Federated Cognitive Learning Framework in Multiagent Systems

IEEE Transactions on Cognitive and Developmental Systems (2023)

Hongjian Shi Jianqing Zhang Shuming Fan Ruhui Ma Haibing Guan

With the increase in data volume and environment complexity, real-world problems require more advanced algorithms to acquire useful information for further analysis or decision making. Cognitive learning (CL) effectively handles incomplete information, and multiagent systems can provide enough data for analysis. Inspired by distributed machine learning, federated learning (FL) has become an efficient framework for implementing CL algorithms in multiagent systems while preserving user privacy. However, traditional communication optimizations on the FL framework suffer from either large communication volumes or large accuracy loss. In this article, we propose pFedEff, a personalized FL framework with efficient communication that can reduce communication volume and preserve training accuracy. pFedEff uses two magnitude masks, two importance masks, and a personalized aggregation method to reduce the model and update size while maintaining the training accuracy. Specifically, we use a pretraining magnitude mask for approximated regularization to reduce the magnitude of ineffective parameters during training. We also use a post-training magnitude mask to eliminate the low-magnitude parameters after training. Furthermore, we use uploading and downloading importance masks to reduce the communication volume in both upload and download streams. Our experimental results show that pFedEff reduces up to 94% communication volume with only a 1% accuracy loss over other state-of-the-art FL algorithms. In addition, we conduct multiple ablation studies to evaluate the influence of hyperparameters in pFedEff, which shows the flexibility of pFedEff and its applicability in different scenarios.

Upload

Hyperparameter

10.1109/tcds.2023.3288985

Cite

Citations (3)