Results 281 to 290 of about 3,637,319 (313)
Some of the next articles are maybe not open access.
RASSP Benchmark 2 Technical Description.
1995Abstract : This report describes the second in a series of application problems which are intended to measure the performance of a process for rapid prototyping of embedded digital signal processors. The rapid prototyping process is being developed for the ARPA/Tri-Services Rapid Prototyping of Application Specific Signal Processors (RASSP) program ...
Allan H. Anderson +4 more
openaire +1 more source
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation
arXiv.orgThe recent breakthroughs in OpenAI's GPT4o model have demonstrated surprisingly good capabilities in image generation and editing, resulting in significant excitement in the community.
Zhiyuan Yan +9 more
semanticscholar +1 more source
A complete diploid human genome benchmark for personalized genomics
bioRxivHuman genome resequencing typically involves mapping reads to a reference genome to call variants; however, this approach suffers from both technical and reference biases, leaving many duplicated and structurally polymorphic regions of the genome ...
Nancy F. Hansen +64 more
semanticscholar +1 more source
Advancing Image Understanding in Poor Visibility Environments: A Collective Benchmark Study
IEEE Transactions on Image Processing, 2019Existing enhancement methods are empirically expected to help the high-level end computer vision task: however, that is observed to not always be the case in practice.
Wenhan Yang +67 more
semanticscholar +1 more source
Benchmarking the Delivery of Technical Support
Research-Technology Management, 1993Technology must serve two masters. First, it must serve the forward end of the business, assuring advance in products and processes for now and the future. Equally important, it must serve the operating units with their innumerable technical support requirements--from manufacturing to engineering to marketing to customer support.
openaire +1 more source
European Conference on Computer Vision
For decades, human-computer interaction has fundamentally been manual. Even today, almost all productive work done on the computer necessitates human input at every step.
Raghav Kapoor +6 more
semanticscholar +1 more source
For decades, human-computer interaction has fundamentally been manual. Even today, almost all productive work done on the computer necessitates human input at every step.
Raghav Kapoor +6 more
semanticscholar +1 more source
Journal of Computing and Information Science in Engineering
This research introduces DesignQA, a novel benchmark aimed at evaluating the proficiency of multimodal large language models (MLLMs) in comprehending and applying engineering requirements in technical documentation. Developed with a focus on real-world
Anna C. Doris +5 more
semanticscholar +1 more source
This research introduces DesignQA, a novel benchmark aimed at evaluating the proficiency of multimodal large language models (MLLMs) in comprehending and applying engineering requirements in technical documentation. Developed with a focus on real-world
Anna C. Doris +5 more
semanticscholar +1 more source
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model
arXiv.orgWe present Step-Video-T2V, a state-of-the-art text-to-video pre-trained model with 30B parameters and the ability to generate videos up to 204 frames in length. A deep compression Variational Autoencoder, Video-VAE, is designed for video generation tasks,
Guoqing Ma +99 more
semanticscholar +1 more source
Implementation and Numerical Techniques for One EFlop/s HPL-AI Benchmark on Fugaku
ACM SIGPLAN Symposium on Scala, 2020Our performance benchmark of HPL-AI on the supercomputer Fugaku was awarded the 55th Top500. The effective performance was 1.42 EFlop/s, and the world's first achievement to exceed the wall of Exa-scale in a floating-point arithmetic benchmark.
Shuhei Kudo +3 more
semanticscholar +1 more source
arXiv.org
In this report, we introduce Ovis-U1, a 3-billion-parameter unified model that integrates multimodal understanding, text-to-image generation, and image editing capabilities.
G. Wang +11 more
semanticscholar +1 more source
In this report, we introduce Ovis-U1, a 3-billion-parameter unified model that integrates multimodal understanding, text-to-image generation, and image editing capabilities.
G. Wang +11 more
semanticscholar +1 more source

