LIVIVO - The Search Portal for Life Sciences

zur deutschen Oberfläche wechseln
Advanced search

Search results

Result 1 - 10 of total 31

Search options

  1. Article ; Online: JsonCurer: Data Quality Management for JSON Based on an Aggregated Schema.

    Xiong, Kai / Xu, Xinyi / Fu, Siwei / Weng, Di / Wang, Yongheng / Wu, Yingcai

    IEEE transactions on visualization and computer graphics

    2024  Volume PP

    Abstract: High-quality data is critical to deriving useful and reliable information. However, real-world data often contains quality issues undermining the value of the derived information. Most existing research on data quality management focuses on tabular data, ...

    Abstract High-quality data is critical to deriving useful and reliable information. However, real-world data often contains quality issues undermining the value of the derived information. Most existing research on data quality management focuses on tabular data, leaving semi-structured data under-exploited. Due to the schema-less and hierarchical features of semi-structured data, discovering and fixing quality issues is challenging and time-consuming. To address the challenge, this paper presents JsonCurer, an interactive visualization system to assist with data quality management in the context of JSON data. To have an overview of quality issues, we first construct a taxonomy based on interviews with data practitioners and a review of 119 real-world JSON files. Then we highlight a schema visualization that presents structural information, statistical features, and quality issues of JSON data. Based on a similarity-based aggregation technique, the visualization depicts the entire JSON data with a concise tree, where summary visualizations are given above each node, and quality issues are illustrated using Bubble Sets across nodes. We evaluate the effectiveness and usability of JsonCurer with two case studies. One is in the domain of data analysis while the other concerns quality assurance in MongoDB documents. The source code of JsonCurer is available under the Apache License 2.0 at https://github.com/changevis/JsonCurer.
    Language English
    Publishing date 2024-04-16
    Publishing country United States
    Document type Journal Article
    ISSN 1941-0506
    ISSN (online) 1941-0506
    DOI 10.1109/TVCG.2024.3388556
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  2. Article ; Online: Relation-driven Query of Multiple Time Series.

    Liu, Shuhan / Tian, Yuan / Deng, Zikun / Cui, Weiwei / Zhang, Haidong / Weng, Di / Wu, Yingcai

    IEEE transactions on visualization and computer graphics

    2024  Volume PP

    Abstract: Querying time series based on their relations is a crucial part of multiple time series analysis. By retrieving and understanding time series relations, analysts can easily detect anomalies and validate hypotheses in complex time series datasets. However, ...

    Abstract Querying time series based on their relations is a crucial part of multiple time series analysis. By retrieving and understanding time series relations, analysts can easily detect anomalies and validate hypotheses in complex time series datasets. However, current relation extraction approaches, including knowledge- and data-driven ones, tend to be laborious and do not support heterogeneous relations. By conducting a formative study with 11 experts, we concluded six time series relations, including correlation, causality, similarity, lag, arithmetic, and meta, and summarized three pain points in querying time series involving these relations. We proposed RelaQ, an interactive system that supports the time series query via relation specifications. RelaQ allows users to intuitively specify heterogeneous relations when querying multiple time series, understand the query results based on a scalable, multi-level visualization, and explore possible relations beyond the existing queries. RelaQ is evaluated with two cases and a user study with 12 participants, showing promising effectiveness and usability.
    Language English
    Publishing date 2024-05-07
    Publishing country United States
    Document type Journal Article
    ISSN 1941-0506
    ISSN (online) 1941-0506
    DOI 10.1109/TVCG.2024.3397554
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  3. Article ; Online: Interactive Table Synthesis with Natural Language.

    Huang, Yanwei / Zhou, Yunfan / Chen, Ran / Pan, Changhao / Shu, Xinhuan / Weng, Di / Wu, Yingcai

    IEEE transactions on visualization and computer graphics

    2023  Volume PP

    Abstract: Tables are a ubiquitous data format for insight communication. However, transforming data into consumable tabular views remains a challenging and time-consuming task. To lower the barrier of such a task, research efforts have been devoted to developing ... ...

    Abstract Tables are a ubiquitous data format for insight communication. However, transforming data into consumable tabular views remains a challenging and time-consuming task. To lower the barrier of such a task, research efforts have been devoted to developing interactive approaches for data transformation, but many approaches still presume that their users have considerable knowledge of various data transformation concepts and functions. In this study, we leverage natural language (NL) as the primary interaction modality to improve the accessibility of average users to performing complex data transformation and facilitate intuitive table generation and editing. Designing an NL-driven data transformation approach introduces two challenges: a) NL-driven synthesis of interpretable pipelines and b) incremental refinement of synthesized tables. To address these challenges, we present NL2Rigel, an interactive tool that assists users in synthesizing and improving tables from semi-structured text with NL instructions. Based on a large language model and prompting techniques, NL2Rigel can interpret the given NL instructions into a table synthesis pipeline corresponding to Rigel specifications, a declarative language for tabular data transformation. An intuitive interface is designed to visualize the synthesis pipeline and the generated tables, helping users understand the transformation process and refine the results efficiently with targeted NL instructions. The comprehensiveness of NL2Rigel is demonstrated with an example gallery, and we further confirmed NL2Rigel's usability with a comparative user study by showing that the task completion time with NL2Rigel is significantly shorter than that with the original version of Rigel with comparable completion rates.
    Language English
    Publishing date 2023-11-01
    Publishing country United States
    Document type Journal Article
    ISSN 1941-0506
    ISSN (online) 1941-0506
    DOI 10.1109/TVCG.2023.3329120
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  4. Book ; Online: Positive emotions help rank negative reviews in e-commerce

    Weng, Di / Zhao, Jichang

    2020  

    Abstract: Negative reviews, the poor ratings in postpurchase evaluation, play an indispensable role in e-commerce, especially in shaping future sales and firm equities. However, extant studies seldom examine their potential value for sellers and producers in ... ...

    Abstract Negative reviews, the poor ratings in postpurchase evaluation, play an indispensable role in e-commerce, especially in shaping future sales and firm equities. However, extant studies seldom examine their potential value for sellers and producers in enhancing capabilities of providing better services and products. For those who exploited the helpfulness of reviews in the view of e-commerce keepers, the ranking approaches were developed for customers instead. To fill this gap, in terms of combining description texts and emotion polarities, the aim of the ranking method in this study is to provide the most helpful negative reviews under a certain product attribute for online sellers and producers. By applying a more reasonable evaluating procedure, experts with related backgrounds are hired to vote for the ranking approaches. Our ranking method turns out to be more reliable for ranking negative reviews for sellers and producers, demonstrating a better performance than the baselines like BM25 with a result of 8% higher. In this paper, we also enrich the previous understandings of emotions in valuing reviews. Specifically, it is surprisingly found that positive emotions are more helpful rather than negative emotions in ranking negative reviews. The unexpected strengthening from positive emotions in ranking suggests that less polarized reviews on negative experience in fact offer more rational feedbacks and thus more helpfulness to the sellers and producers. The presented ranking method could provide e-commerce practitioners with an efficient and effective way to leverage negative reviews from online consumers.

    Comment: Emotion lexicons are publicly available at https://doi.org/10.6084/m9.figshare.12327680.v1
    Keywords Computer Science - Computation and Language ; Computer Science - Computers and Society
    Subject code 001
    Publishing date 2020-05-19
    Publishing country us
    Document type Book ; Online
    Database BASE - Bielefeld Academic Search Engine (life sciences selection)

    More links

    Kategorien

  5. Article ; Online: A survey of urban visual analytics: Advances and future directions.

    Deng, Zikun / Weng, Di / Liu, Shuhan / Tian, Yuan / Xu, Mingliang / Wu, Yingcai

    Computational visual media

    2022  Volume 9, Issue 1, Page(s) 3–39

    Abstract: Developing effective visual analytics systems demands care in characterization of domain problems and integration of visualization techniques and computational models. Urban visual analytics has already achieved remarkable success in tackling urban ... ...

    Abstract Developing effective visual analytics systems demands care in characterization of domain problems and integration of visualization techniques and computational models. Urban visual analytics has already achieved remarkable success in tackling urban problems and providing fundamental services for smart cities. To promote further academic research and assist the development of industrial urban analytics systems, we comprehensively review urban visual analytics studies from four perspectives. In particular, we identify 8 urban domains and 22 types of popular visualization, analyze 7 types of computational method, and categorize existing systems into 4 types based on their integration of visualization techniques and computational models. We conclude with potential research directions and opportunities.
    Language English
    Publishing date 2022-10-18
    Publishing country China
    Document type Journal Article ; Review
    ZDB-ID 2844021-3
    ISSN 2096-0662 ; 2096-0433
    ISSN (online) 2096-0662
    ISSN 2096-0433
    DOI 10.1007/s41095-022-0275-7
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  6. Article ; Online: Rigel: Transforming Tabular Data by Declarative Mapping.

    Chen, Ran / Weng, Di / Huang, Yanwei / Shu, Xinhuan / Zhou, Jiayi / Sun, Guodao / Wu, Yingcai

    IEEE transactions on visualization and computer graphics

    2022  Volume 29, Issue 1, Page(s) 128–138

    Abstract: We present Rigel, an interactive system for rapid transformation of tabular data. Rigel implements a new declarative mapping approach that formulates the data transformation procedure as direct mappings from data to the row, column, and cell channels of ... ...

    Abstract We present Rigel, an interactive system for rapid transformation of tabular data. Rigel implements a new declarative mapping approach that formulates the data transformation procedure as direct mappings from data to the row, column, and cell channels of the target table. To construct such mappings, Rigel allows users to directly drag data attributes from input data to these three channels and indirectly drag or type data values in a spreadsheet, and possible mappings that do not contradict these interactions are recommended to achieve efficient and straightforward data transformation. The recommended mappings are generated by enumerating and composing data variables based on the row, column, and cell channels, thereby revealing the possibility of alternative tabular forms and facilitating open-ended exploration in many data transformation scenarios, such as designing tables for presentation. In contrast to existing systems that transform data by composing operations (like transposing and pivoting), Rigel requires less prior knowledge on these operations, and constructing tables from the channels is more efficient and results in less ambiguity than generating operation sequences as done by the traditional by-example approaches. User study results demonstrated that Rigel is significantly less demanding in terms of time and interactions and suits more scenarios compared to the state-of-the-art by-example approach. A gallery of diverse transformation cases is also presented to show the potential of Rigel's expressiveness.
    Language English
    Publishing date 2022-12-16
    Publishing country United States
    Document type Journal Article
    ISSN 1941-0506
    ISSN (online) 1941-0506
    DOI 10.1109/TVCG.2022.3209385
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  7. Article ; Online: Multilevel Visual Analysis of Aggregate Geo-Networks.

    Deng, Zikun / Chen, Shifu / Xie, Xiao / Sun, Guodao / Xu, Mingliang / Weng, Di / Wu, Yingcai

    IEEE transactions on visualization and computer graphics

    2022  Volume PP

    Abstract: Numerous patterns found in urban phenomena, such as air pollution and human mobility, can be characterized as many directed geospatial networks (geo-networks) that represent spreading processes in urban space. These geo-networks can be analyzed from ... ...

    Abstract Numerous patterns found in urban phenomena, such as air pollution and human mobility, can be characterized as many directed geospatial networks (geo-networks) that represent spreading processes in urban space. These geo-networks can be analyzed from multiple levels, ranging from the macro-level of summarizing all geo-networks, meso-level of comparing or summarizing parts of geo-networks, and micro-level of inspecting individual geo-networks. Most of the existing visualizations cannot support multilevel analysis well. These techniques work by: 1) showing geo-networks separately with multiple maps leads to heavy context switching costs between different maps; 2) summarizing all geo-networks into a single network can lead to the loss of individual information; 3) drawing all geo-networks onto one map might suffer from the visual scalability issue in distinguishing individual geo-networks. In this study, we propose GeoNetverse, a novel visualization technique for analyzing aggregate geo-networks from multiple levels. Inspired by metro maps, GeoNetverse balances the overview and details of the geo-networks by placing the edges shared between geo-networks in a stacked manner. To enhance the visual scalability, GeoNetverse incorporates a level-of-detail rendering, a progressive crossing minimization, and a coloring technique. A set of evaluations was conducted to evaluate GeoNetverse from multiple perspectives.
    Language English
    Publishing date 2022-12-19
    Publishing country United States
    Document type Journal Article
    ISSN 1941-0506
    ISSN (online) 1941-0506
    DOI 10.1109/TVCG.2022.3229953
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  8. Article ; Online: Nebula: A Coordinating Grammar of Graphics.

    Chen, Ran / Shu, Xinhuan / Chen, Jiahui / Weng, Di / Tang, Junxiu / Fu, Siwei / Wu, Yingcai

    IEEE transactions on visualization and computer graphics

    2022  Volume 28, Issue 12, Page(s) 4127–4140

    Abstract: In multiple coordinated views (MCVs), visualizations across views update their content in response to users' interactions in other views. Interactive systems provide direct manipulation to create coordination between views, but are restricted to limited ... ...

    Abstract In multiple coordinated views (MCVs), visualizations across views update their content in response to users' interactions in other views. Interactive systems provide direct manipulation to create coordination between views, but are restricted to limited types of predefined templates. By contrast, textual specification languages enable flexible coordination but expose technical burden. To bridge the gap, we contribute Nebula, a grammar based on natural language for coordinating visualizations in MCVs. The grammar design is informed by a novel framework based on a systematic review of 176 coordinations from existing theories and applications, which describes coordination by demonstration, i.e., how coordination is performed by users. With the framework, Nebula specification formalizes coordination as a composition of user- and coordination-triggered interactions in origin and destination views, respectively, along with potential data transformation between the interactions. We evaluate Nebula by demonstrating its expressiveness with a gallery of diverse examples and analyzing its usability on cognitive dimensions.
    Language English
    Publishing date 2022-10-26
    Publishing country United States
    Document type Journal Article
    ISSN 1941-0506
    ISSN (online) 1941-0506
    DOI 10.1109/TVCG.2021.3076222
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  9. Article ; Online: Visualizing Large-Scale Spatial Time Series with GeoChron.

    Deng, Zikun / Chen, Shifu / Schreck, Tobias / Deng, Dazhen / Tang, Tan / Xu, Mingliang / Weng, Di / Wu, Yingcai

    IEEE transactions on visualization and computer graphics

    2023  Volume 30, Issue 1, Page(s) 1194–1204

    Abstract: In geo-related fields such as urban informatics, atmospheric science, and geography, large-scale spatial time (ST) series (i.e., geo-referred time series) are collected for monitoring and understanding important spatiotemporal phenomena. ST series ... ...

    Abstract In geo-related fields such as urban informatics, atmospheric science, and geography, large-scale spatial time (ST) series (i.e., geo-referred time series) are collected for monitoring and understanding important spatiotemporal phenomena. ST series visualization is an effective means of understanding the data and reviewing spatiotemporal phenomena, which is a prerequisite for in-depth data analysis. However, visualizing these series is challenging due to their large scales, inherent dynamics, and spatiotemporal nature. In this study, we introduce the notion of patterns of evolution in ST series. Each evolution pattern is characterized by 1) a set of ST series that are close in space and 2) a time period when the trends of these ST series are correlated. We then leverage Storyline techniques by considering an analogy between evolution patterns and sessions, and finally design a novel visualization called GeoChron, which is capable of visualizing large-scale ST series in an evolution pattern-aware and narrative-preserving manner. GeoChron includes a mining framework to extract evolution patterns and two-level visualizations to enhance its visual scalability. We evaluate GeoChron with two case studies, an informal user study, an ablation study, parameter analysis, and running time analysis.
    Language English
    Publishing date 2023-12-25
    Publishing country United States
    Document type Journal Article
    ISSN 1941-0506
    ISSN (online) 1941-0506
    DOI 10.1109/TVCG.2023.3327162
    Database MEDical Literature Analysis and Retrieval System OnLINE

    More links

    Kategorien

  10. Article: Effect of particle size on the physicochemical and antioxidant properties of Forsythia suspensa (Thunb.)Vahl leaf powders

    Weng, Di / Zha, Sheng-Hua / Zhu, Yuan / Li, Hang / Hou, Shou-Bu / Zhao, Qing-Sheng / Zhao, Bing

    Powder technology. 2022 Sept., v. 410

    2022  

    Abstract: Four powders of Forsythia suspensa leaf were prepared by coarse grinding and subsequent ball milling. The effects of particle size on the physicochemical and antioxidant properties of the powders were evaluated. The results showed that the particle size ... ...

    Abstract Four powders of Forsythia suspensa leaf were prepared by coarse grinding and subsequent ball milling. The effects of particle size on the physicochemical and antioxidant properties of the powders were evaluated. The results showed that the particle size of the powder decreased from 107.99 to 29.20 μm by ball milling for 5 h. In addition, the ball-milled powders had substantial advantages in water holding, solubility and swelling capacities. Thus, the ball-milled powders had a better dissolution of phenolic, flavonoid, and polysaccharide components (25.45, 1.99 and 44.19 mg/g of powder for 5 h of ball milling), ultimately resulting in a superior antioxidant activity (maximum reductions of 14.96% and 19.20% for DPPH and ABTS radicals, respectively). The FTIR analysis showed that the structure of the active ingredients remained intact after ball milling. In conclusion, ball milling is a promising technique for preparing Forsythia suspensa leaf powder with improved physicochemical and biological properties.
    Keywords Forsythia suspensa ; antioxidant activity ; antioxidants ; flavonoids ; leaves ; particle size ; polysaccharides ; solubility ; technology
    Language English
    Dates of publication 2022-09
    Publishing place Elsevier B.V.
    Document type Article
    ISSN 0032-5910
    DOI 10.1016/j.powtec.2022.117866
    Database NAL-Catalogue (AGRICOLA)

    More links

    Kategorien

To top