Source author record

Mark Johnson

Mark Johnson appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language Computer Vision cond-mat.mtrl-sci Artificial Intelligence Biological Physics cond-mat.other cond-mat.soft cond-mat.str-el hep-ph Machine Learning math.AC math.AG physics.chem-ph physics.ins-det Quantitative Methods Tissues and Organs

Catalog footprint

What is connected

14works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

OpenMetBuoy-V2021: an easy-to-build, affordable, customizable, open source instrument for oceanographic measurements of drift and waves in sea ice and the open ocean

There is a wide consensus within the polar science, meteorology, and oceanography communities that more in-situ observations of the ocean, atmosphere, and sea ice, are required to further improve operational forecasting model skills. Traditionally, the volume of such measurements has been limited by the high cost of commercially available instruments. An increasingly attractive solution to this cost issue is to use instruments produced in-house from open source hardware, firmware, and post processing building blocks. In the present work, we release the next iteration of the open source drifter and waves monitoring instruments. The new design is both significantly less expensive, much easier to build and assemble for people without specific microelectronics and programming competence, more easily extendable and customizable, and two orders of magnitude more power efficient. Improving performance and reducing noise levels and costs compared with our previous generation of instruments is possible in large part thanks to progress from the electronics component industry. As a result, we believe that this will allow scientists in geosciences to increase by an order of magnitude the amount of in-situ data they can collect under a constant instrumentation budget. In the following, we offer 1) detailed overview of our hardware and software solution, 2) in-situ validation and benchmarking of our instrument, 3) full open source release of both hardware and software blueprints. We hope that this work, and the associated open source release, may be a milestone that will allow our scientific fields to transition towards open source, community driven instrumentation. We believe that this could have a considerable impact on many fields, by making in-situ instrumentation at least an order of magnitude less expensive and more customizable than it has been for the last 50 years.

preprint2021arXiv

ECOL-R: Encouraging Copying in Novel Object Captioning with Reinforcement Learning

Novel Object Captioning is a zero-shot Image Captioning task requiring describing objects not seen in the training captions, but for which information is available from external object detectors. The key challenge is to select and describe all salient detected novel objects in the input images. In this paper, we focus on this challenge and propose the ECOL-R model (Encouraging Copying of Object Labels with Reinforced Learning), a copy-augmented transformer model that is encouraged to accurately describe the novel object labels. This is achieved via a specialised reward function in the SCST reinforcement learning framework (Rennie et al., 2017) that encourages novel object mentions while maintaining the caption quality. We further restrict the SCST training to the images where detected objects are mentioned in reference captions to train the ECOL-R model. We additionally improve our copy mechanism via Abstract Labels, which transfer knowledge from known to novel object types, and a Morphological Selector, which determines the appropriate inflected forms of novel object labels. The resulting model sets new state-of-the-art on the nocaps (Agrawal et al., 2019) and held-out COCO (Hendricks et al., 2016) benchmarks.

preprint2020arXiv

Disfluency Detection using Auto-Correlational Neural Networks

In recent years, the natural language processing community has moved away from task-specific feature engineering, i.e., researchers discovering ad-hoc feature representations for various tasks, in favor of general-purpose methods that learn the input representation by themselves. However, state-of-the-art approaches to disfluency detection in spontaneous speech transcripts currently still depend on an array of hand-crafted features, and other representations derived from the output of pre-existing systems such as language models or dependency parsers. As an alternative, this paper proposes a simple yet effective model for automatic disfluency detection, called an auto-correlational neural network (ACNN). The model uses a convolutional neural network (CNN) and augments it with a new auto-correlation operator at the lowest layer that can capture the kinds of "rough copy" dependencies that are characteristic of repair disfluencies in speech. In experiments, the ACNN model outperforms the baseline CNN on a disfluency detection task with a 5% increase in f-score, which is close to the previous best result on this task.

preprint2020arXiv

Improving Disfluency Detection by Self-Training a Self-Attentive Model

Self-attentive neural syntactic parsers using contextualized word embeddings (e.g. ELMo or BERT) currently produce state-of-the-art results in joint parsing and disfluency detection in speech transcripts. Since the contextualized word embeddings are pre-trained on a large amount of unlabeled data, using additional unlabeled data to train a neural model might seem redundant. However, we show that self-training - a semi-supervised technique for incorporating unlabeled data - sets a new state-of-the-art for the self-attentive parser on disfluency detection, demonstrating that self-training provides benefits orthogonal to the pre-trained contextualized word representations. We also show that ensembling self-trained parsers provides further gains for disfluency detection.

preprint2020arXiv

Neural Constituency Parsing of Speech Transcripts

This paper studies the performance of a neural self-attentive parser on transcribed speech. Speech presents parsing challenges that do not appear in written text, such as the lack of punctuation and the presence of speech disfluencies (including filled pauses, repetitions, corrections, etc.). Disfluencies are especially problematic for conventional syntactic parsers, which typically fail to find any EDITED disfluency nodes at all. This motivated the development of special disfluency detection systems, and special mechanisms added to parsers specifically to handle disfluencies. However, we show here that neural parsers can find EDITED disfluency nodes, and the best neural parsers find them with an accuracy surpassing that of specialized disfluency detection systems, thus making these specialized mechanisms unnecessary. This paper also investigates a modified loss function that puts more weight on EDITED nodes. It also describes tree-transformations that simplify the disfluency detection task by providing alternative encodings of disfluencies and syntactic information.

preprint2019arXiv

nocaps: novel object captioning at scale

Image captioning models have achieved impressive results on datasets containing limited visual concepts and large amounts of paired image-caption training data. However, if these models are to ever function in the wild, a much larger variety of visual concepts must be learned, ideally from less supervision. To encourage the development of image captioning models that can learn visual concepts from alternative data sources, such as object detection datasets, we present the first large-scale benchmark for this task. Dubbed 'nocaps', for novel object captioning at scale, our benchmark consists of 166,100 human-generated captions describing 15,100 images from the OpenImages validation and test sets. The associated training data consists of COCO image-caption pairs, plus OpenImages image-level labels and object bounding boxes. Since OpenImages contains many more classes than COCO, nearly 400 object classes seen in test images have no or very few associated training captions (hence, nocaps). We extend existing novel object captioning models to establish strong baselines for this benchmark and provide analysis to guide future work on this task.

preprint2016arXiv

An empirical study for Vietnamese dependency parsing

This paper presents an empirical comparison of different dependency parsers for Vietnamese, which has some unusual characteristics such as copula drop and verb serialization. Experimental results show that the neural network-based parsers perform significantly better than the traditional parsers. We report the highest parsing scores published to date for Vietnamese with the labeled attachment score (LAS) at 73.53% and the unlabeled attachment score (UAS) at 80.66%.

preprint2016arXiv

N(N)LO event files: applications and prospects

In this contribution we comment on the use of NLO n-Tuples and consider their extension to NNLO. As an application of n-Tuples we present preliminary results of a strong coupling extraction from a measurement of the production of Z + 2, 3, 4 jets at the LHC at 7 TeV.

preprint2016arXiv

SPICE: Semantic Propositional Image Caption Evaluation

There is considerable interest in the task of automatically generating image captions. However, evaluation is challenging. Existing automatic evaluation metrics are primarily sensitive to n-gram overlap, which is neither necessary nor sufficient for the task of simulating human judgment. We hypothesize that semantic propositional content is an important component of human caption evaluation, and propose a new automated caption evaluation metric defined over scene graphs coined SPICE. Extensive evaluations across a range of models and datasets indicate that SPICE captures human judgments over model-generated captions better than other automatic metrics (e.g., system-level correlation of 0.88 with human judgments on the MS COCO dataset, versus 0.43 for CIDEr and 0.53 for METEOR). Furthermore, SPICE can answer questions such as `which caption-generator best understands colors?' and `can caption-generators count?'

preprint2014arXiv

Electronic and vibronic properties of a discotic liquid-crystal and its charge transfer complex

Discotic liquid crystalline (DLC) charge transfer (CT) complexes combine visible light absorption and rapid charge transfer characteristics within the CT complex, being favorable properties for photovoltaic (PV) applications. We present a detailed study of the electronic and vibrational properties of the prototypic 1:1 mixture of discotic 2,3,6,7,10,11-hexakishexyloxytriphenylene (HAT6) and 2,4,7-trinitro-9-fluorenone (TNF). It is shown that intermolecular charge transfer occurs in the groundstate of the complex: a charge delocalization of about 10-2 electron from the HAT6 core to TNF is deduced from both Raman and our previous NMR measurements (Reference 32), implying the presence of permanent dipoles at the donor-acceptor interface. A combined analysis of density functional theory calculations, resonant Raman and UV-VIS absorption measurements indicate that fast relaxation occurs in the UV region due to intramolecular vibronic coupling of HAT6 quinoidal modes with lower lying electronic states. Relatively slower relaxation in the visible region CT-band of the complex is also indicated, which likely involves motions of the TNF nitro groups. The fast quinoidal relaxation process in the hot UV band of HAT6 relates to pseudo-Jahn-Teller interactions in a single benzene unit, suggesting that the underlying vibronic coupling mechanism can be generic for polyaromatic hydrocarbons. Both the presence of CT dipoles and relatively slow relaxation processes in the CT band can be relevant concerning the design of DLC based organic PV systems.

preprint2014arXiv

High-throughput screening for modulators of cellular contractile force

When cellular contractile forces are central to pathophysiology, these forces comprise a logical target of therapy. Nevertheless, existing high-throughput screens are limited to upstream signaling intermediates with poorly defined relationship to such a physiological endpoint. Using cellular force as the target, here we screened libraries to identify novel drug candidates in the case of human airway smooth muscle cells in the context of asthma, and also in the case of Schlemm's canal endothelial cells in the context of glaucoma. This approach identified several drug candidates for both asthma and glaucoma. We attained rates of 1000 compounds per screening day, thus establishing a force-based cellular platform for high-throughput drug discovery.

preprint2014arXiv

Ubiquity of complete intersection liaison classes

In this paper, we provide constructions to enumerate large numbers of CI-liaison classes. To this end, we introduce a liaison invariant and prove several results concerning it, notably that it commutes with hypersurface sections. This theory is applied to the CI-liaison classes of ruled joins of projective schemes, yielding strong obstructions for such joins to lie in the same liaison class. A second construction arises from the actions of automorphisms on liaison classes, allowing the enumeration of many liaison classes of perfect ideals of codimension at least three.

preprint2009arXiv

Film Edge Nonlocal Spin Valves

Spintronics is a new paradigm for integrated digital electronics. Recently established as a niche for nonvolatile magnetic random access memory (MRAM), it offers new functionality while demonstrating low power and high speed performance. However, to reach high density spintronic technology must make a transition to the nanometer scale. Prototype devices are presently made using a planar geometry and have an area determined by the lithographic feature size, currently about 100 nm. Here we present a new nonplanar geometry in which one lateral dimension is given by a film thickness, the order of 10 nm. With this new approach, cell sizes can shrink by an order of magnitude. The geometry is demonstrated with a nonlocal spin valve, where we study devices with an injector/detector separation much less than the spin diffusion length.

preprint2006arXiv

Hierarchical geometric frustration in La3Cu2VO9

The crystallographic structure and magnetic properties of the La3Cu2VO9 were investigated by powder neutron diffraction and magnetization measurements. The compound materializes geometric frustration at two spatial scales, within clusters and between clusters, and at different temperature scales. It is shown by exactly solving the hamiltonian spectrum that collective spins are formed on each clusters at low temperature before inter-clusters coupling operates.

Mark Johnson

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

OpenMetBuoy-V2021: an easy-to-build, affordable, customizable, open source instrument for oceanographic measurements of drift and waves in sea ice and the open ocean

ECOL-R: Encouraging Copying in Novel Object Captioning with Reinforcement Learning

Disfluency Detection using Auto-Correlational Neural Networks

Improving Disfluency Detection by Self-Training a Self-Attentive Model

Neural Constituency Parsing of Speech Transcripts

nocaps: novel object captioning at scale

An empirical study for Vietnamese dependency parsing

N(N)LO event files: applications and prospects

SPICE: Semantic Propositional Image Caption Evaluation

Electronic and vibronic properties of a discotic liquid-crystal and its charge transfer complex

High-throughput screening for modulators of cellular contractile force

Ubiquity of complete intersection liaison classes

Film Edge Nonlocal Spin Valves

Hierarchical geometric frustration in La3Cu2VO9