Building Mental Models through Preview of Autopilot Behaviors

Autopilot behavior can help to ensuresmoothhuman-vehicle collaboration during the initial exploration stagewith thevehicle . AutoPreview framework can provide a deeperunderstanding of autopilot behavior compared to direct interaction with the vehicle . We conducted acase study on humans-vehicles collaboration and built a prototype of our framework with theCARLA simulator .…

Image Level or Object Level A Tale of Two Resampling Strategies for Long Tailed Detection

Training on datasets with long-tailed distributions has been challenging formajor recognition tasks such as classification and detection . We show that image-level andobject-level resamplings are both important, and thus unify them with a jointresampling strategy (RIO) Our method outperforms state-of-the-art long-taileddetection and segmentation methods on LVIS v0.5 across various backbones.…

MIPROT A Medical Image Processing Toolbox for MATLAB

This paper presents a Matlab toolbox to perform basic image processing and visualization tasks, particularly designed for medical image processing . The toolbox is entirely written in native Matlab code, but is fast and flexible . Main use cases for the toolbox are illustrated here, including image input/output, pre-processing, filtering, image registration and visualisation .…

All Labels Are Not Created Equal Enhancing Semi supervision via Label Grouping and Co training

Pseudo-labeling is a key component in semi-supervised learning . SemCo leverages label semantics and co-training to address this problem . We train two classifiers with twodifferent views of the class labels . We show that our method achieves state-of-the-art performance across various SSL tasks including 5.6% accuracy improvement on Mini-ImageNetdataset with 1000 labeled examples .…

Speaking of Trust Speech as a Measure of Trust

We propose to use speech cues (on what, when and how the user talks) as an objective real-time measure of trust . This could be implemented in the robot to calibrate towards appropriate trust . However, we would like to open the discussion on how to deal with the ethical implications of this trust measure .…

What We Measure in Mixed Reality Experiments

There are many potential measures that one might use when evaluating mixed reality experiences . In this position paper I will argue that there arevarious stances to take for evaluation, depending on the framing of the work . I will sketch out some directions for developing more robust measures that can help the field move forward.…

StylePTB A Compositional Benchmark for Fine grained Controllable Text Style Transfer

Text style transfer aims to controllably generate text with targetedstylistic changes while maintaining core meaning from the source sentenceconstant . Many existing style transfer benchmarks do not offer fine-grained control on sentence structure, emphasis, and content of the sentence . We introduce a large-scale benchmark, StylePTB, with pairedsentences undergoing 21 stylistic changes spanning atomic lexical,syntactic, semantic, and thematic transfers of text .…

Glance and Gaze Inferring Action aware Points for One Stage Human Object Interaction Detection

Glance and Gaze Network (GGNet) adaptively models a set of actionaware points (ActPoints) via glance and gaze steps . GGNet outperforms state-of-the-art methods by significant margins on both V-COCO and HICODET benchmarks . The project is available in code on GitHub and can be downloaded from the site of Sherlock Holmes’ Holmes’ book, “Sherlock Holmes 221,” “Glance & Gaze Networks,” by Andrew MacIntosh, on November 4, at http://www.gmail.com/SherlockHolmes221/GGNet.…

WLFC Write Less in Flash based Cache

Flash-based disk caches, for example Bcache and Flashcache, has gainedtremendous popularity in industry in the last decade . But these cache systemshave a worse write performance than the read performance because of theasymmetric I/O costs and the internal GC mechanism .…

Adversarial Open Domain Adaption for Sketch to Photo Synthesis

The open-domain sketch-to-photo translation is challenging due to the lack of training supervision and the large geometrydistortion between the freehand sketch and photo domains . We propose a framework that jointlylearns sketch to photo and photo-to sketch generation . We validate our method on the Scribble and SketchyCOCO datasets.…

Feedback Vertex Set on Hamiltonian Graphs

We study the computational complexity of Feedback Vertex Set on subclasses ofHamiltonian graphs . In particular, we consider Hamiltonian graphs that areregular or are planar and regular . We also study the less known class of $p$-Hamiltonian-ordered graphs, which admit for any $ p$-tupleof vertices a Hamiltonian cycle visiting them in the order given by the tuples of vertices .…

Approach for modeling single branches of meadow orchard trees with 3D point clouds

The goal of this research is to create a tree model to automaticallydetermine possible pruning points for stand-alone trees within meadows . The algorithm is capable of building a skeleton model based on a pre-segmented photogrammetric 3D point cloud . Good results were achieved in assigning the points to their leading branches and building a virtual treemodel, reaching an overall accuracy of 95.19% .…

On Unifying Misinformation Detection

UnifiedM2 is a general-purpose misinformation model that jointly models multiple domains of misinformation with a single, unified setup . The model is trained to handle four tasks: detecting news bias, clickbait, fake news, and verifying rumors . By grouping these tasks together, the model learns a richer representation of misinformation, which leads to comparable performance across all tasks .…

Characterization of Decomposition of Matrix Multiplication Tensors

The canonical polyadic (CP) decomposition of tensors thatcorresponds to matrix multiplications is studied . Finding the rank of thesetensors and computing the decompositions is a fundamental problem of algebraiccomplexity theory . In this paper, we present a novel decomposition . of the tensormultiplication of matrices of the size 3×3 with 3×6 with rank 40 .…

Quotients of Bounded Natural Functors

The functorial structure of type constructors is the foundation for manydefinition and proof principles in higher-order logic (HOL) In this article, we tackle the preservation question forquotients, the last principle for introducing new types in HOL . Surprisingly, lifting the structure in theobvious manner fails for some quotients .…

Intra Class Uncertainty Loss Function for Classification

In our framework, the features extracted by deep networks of each class arecharacterized by independent Gaussian distribution . The means of the Gaussian play a similar role as the centeranchor in existing methods . In addition, we introduce a margin to intra-class uncertainty to make each cluster more compact and reduce the imbalance of feature distribution from different categories .…

Machine Translation Decoding beyond Beam Search

Beam search is the go-to method for decoding auto-regressive machinetranslation models . While it yields consistent improvements in terms of BLEU, it is only concerned with finding outputs with high model likelihood . Our aim is to establish whether beam search can be replaced by a more powerfulmetric-driven search technique .…

TermAdventure Interactively Teaching UNIX Command Line Text Adventure Style

TermAdventure (TA) is a suite of tools for creating interactive UNIX exercises . These resemble text adventure games, which immersethe user in a text environment and let them interact with it using textualcommands . The suite is released under an open source license, has minimal dependencies and can be used either on a UNIX-style server or a desktop computer running any major OS platform through Docker .…

Information Rate Optimization for Joint Relay and Link in Non Regenerative MIMO Channels

The optimization of the Relay Transform Matrix (RTM) in a two-hop relaynetwork with an average relay power constraint and perfect channel stateinformation at the relay is addressed in this paper . The study considers themost general case in terms of number of transmit and receive antennas at thesource, relay, and destination, with arbitrary correlation of the receivednoise at all terminals .…

Action Conditioned 3D Human Motion Synthesis with Transformer VAE

We tackle the problem of action-conditioned generation of realistic and diverse human motion sequences . In contrast to methods that complete, orextend, motion sequences, this task does not require an initial pose orsequence . We learn an action-aware latent representation for human motionsby training a generative variational autoencoder (VAE) We evaluate our approach on NTU RGB+D, HumanAct12 and UESTC datasets and show improvements over the state of the art .…

Secure Cognitive Radio Communication via Intelligent Reflecting Surface

In this paper, an intelligent reflecting surface (IRS) assisted spectrumsharing underlay cognitive radio (CR) wiretap channel (WTC) is studied . We aim at enhancing the secrecy rate of secondary user in this channel subject tototal power constraint at secondary transmitter (ST), interference powerconstraint (IPC) at primary receiver (PR) and unit modulus constraint atIRS .…

A Hybrid Parallelization Approach for Distributed and Scalable Deep Learning

Deep Neural Networks (DNNs) have recorded great success in handling medical and other complex classification tasks . As the sizes of a DNN model and the available dataset increase, the training process becomes more computationally intensive . We have proposed a generic full end-to-end hybridparallelization approach combining both model and data parallelism forefficiently distributed and scalable training of DNN models .…