<?xml version="1.0" encoding="US-ASCII"?>
<!DOCTYPE rfc SYSTEM "rfc2629.dtd">
<?rfc toc="yes"?>
<?rfc tocompact="yes"?>
<?rfc tocdepth="3"?>
<?rfc tocindent="yes"?>
<?rfc symrefs="yes"?>
<?rfc sortrefs="yes"?>
<?rfc comments="yes"?>
<?rfc inline="yes"?>
<?rfc compact="yes"?>
<?rfc subcompact="no"?>
<rfc category="info" docName="draft-du-tsvwg-odes-problem-statement-01"
     ipr="trust200902">
  <front>
    <title abbrev="DES Problem Statement">Use Cases and Problem Statements of
    Online Data Express Service</title>

    <author fullname="Zongpeng Du" initials="Z." surname="Du">
      <organization>China Mobile</organization>

      <address>
        <postal>
          <street>No.32 XuanWuMen West Street</street>

          <city>Beijing</city>

          <code>100053</code>

          <country>China</country>
        </postal>

        <email>duzongpeng@foxmail.com</email>
      </address>
    </author>

    <author fullname="Kehan Yao" initials="K." surname=" Yao">
      <organization>China Mobile</organization>

      <address>
        <postal>
          <street>No.32 XuanWuMen West Street</street>

          <city>Beijing</city>

          <code>100053</code>

          <country>China</country>
        </postal>

        <email>yaokehan@chinamobile.com</email>
      </address>
    </author>

    <author fullname="Hongwei Yang" initials="H." surname="Yang">
      <organization>China Mobile</organization>

      <address>
        <postal>
          <street>No.32 XuanWuMen West Street</street>

          <city>Beijing</city>

          <code>100053</code>

          <country>China</country>
        </postal>

        <email>yanghongwei@chinamobile.com</email>
      </address>
    </author>

    <date month="" year=""/>

    <area>Internet Area</area>

    <workgroup>TSVWG</workgroup>

    <keyword>online data express service</keyword>

    <abstract>
      <t>This document describes use cases and problem statements of Online
      Data Express Service.</t>
    </abstract>

    <note title="Requirements Language">
      <t>The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
      "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
      document are to be interpreted as described in <xref
      target="RFC2119">RFC 2119</xref>.</t>
    </note>
  </front>

  <middle>
    <section title="Introduction">
      <t>As the rapid development of digital technologies, transporting a bulk
      of data conveniently between two faraway places has become a fundamental
      requirement for the Internet. However, the user experience of
      transferring a large file is not always that satisfying. For example,
      although we can connect to a cloud service anywhere now, sometimes the
      file upload experience is slow and troublesome. Alternatively, we can
      post a hard disk to others for this big data transferring.</t>

      <t>The Online Data Express Service (ODES) in this document is defined as
      a convenient online service, which can transfer a large file or a bundle
      of files between different entities within a time limitation. The file
      size may be at the GB, TB level, or even bigger. The time limitation may
      be several hours, one day or two days.</t>

      <t>The target approach in ODES should be more flexible and convenient
      than the manual hard disk express, and can be quicker than the approach
      of transferring directly by using the Internet and the default TCP
      transport protocol.</t>

      <t>The ODES use cases include inter-cloud backup and disaster recovery,
      film and television editing, scientific computing, cooperation of
      intelligent computing centers, etc. This document also analyzes the
      problems that need to be considered for ODES. A related gap analysis
      document can be found in <xref
      target="I-D.zhao-tsvwg-odes-gap-analysis"/>.</t>

      <t/>
    </section>

    <section anchor="UseCases" title="Use Cases of ODES ">
      <section anchor="UseCase1"
               title="Inter-Cloud Backup and Disaster Recovery">
        <t>As the development of the cloud computing industry, cloud data
        centers are bearing a large amount of various enterprise IT services.
        The storage, transmission, and protection of the massive growth data
        bring new challenges. For example, disaster recovery of core
        application data is necessary to ensure the enterprise data security
        and the business continuity. In the scenario of disaster recovery of
        the operator's traffic data, the daily data backup volume of a single
        IT cloud resource pool is at the TB level. The primary and backup data
        centers are normally built in different locations with long data
        transmission distances. However, they do not have high requirements
        for data transmission timeliness. By utilizing the tidal effect of the
        network, we can use the idle bandwidth at night for the transmission,
        so as to improve the data transmission efficiency and reduce the data
        transmission cost.</t>
      </section>

      <section anchor="UseCase2" title="Film and Television Editing">
        <t>The shooting materials for films and television variety shows need
        to be edited and rendered by a post-production company. Due to the
        uncertain shooting location, the shooting materials need to be
        transferred in bulk to the post-production company's site according to
        the shooting and production cycle. The raw material data volume of a
        large-scale variety show or film is at the PB level, with a single
        transmission of approximately 10TB to 100TB of data. The manual hard
        disk express method involves two times of data copies (the upload at
        the source, and the download at the destination), and manual handling
        (taking an airplane or a high-speed rail to transport the disk array).
        Each trip takes 2-3 days and requires dedicated personnel to operate,
        resulting in poor timeliness and low efficiency. How to fully utilize
        network capabilities and provide convenient online data movement
        services for the audio and video industry through online transmission
        while meeting timeliness and reducing labor investment costs poses new
        challenges to the network.</t>
      </section>

      <section anchor="UseCase3" title="Scientific Computing">
        <t>In the development of intelligent computing and supercomputing, the
        import and export of big data from intelligent computing and
        supercomputing centers is required. However, it lacks efficient and
        low-cost solutions, especially in supporting scientific computing
        scenarios such as astronomy and meteorology. Taking the calculation of
        FAST astronomical data of China as an example, FAST has approximate
        200 observation projects per year, with a single project generating
        observation data of TB~PB magnitude and an annual output of
        approximately 15PB. If the data export is done manually, the data
        export application may be delayed for several months due to the lack
        of dedicated personnel responsible for data copy operations. In
        addition, data transmission and destination data import operations are
        very time-consuming, significantly affecting the timeliness of data
        acquisition. In conclusion, there is an urgent need for an efficient
        and economical online data transmission solution for large-scale data
        migration scenarios in scientific computing.</t>
      </section>

      <section anchor="UseCase4" title="Gene Sequencing">
        <t>Gene sequencing technology is becoming increasingly mature,
        significantly shortening the sequencing time and promoting its
        application comprehensively. It can provide various gene sequencing
        and data analysis services to scientific research institutions,
        medical service institutions, or individuals. Traditional gene
        sequencing mainly relies on local laboratory analyses, and its
        timeliness and scale are constrained by local computing resources,
        making it difficult to improve. Cloud-based gene sequencing data have
        gradually become an industry trend. A domestic gene company would have
        a gene sequencing data volume of 100PB/year, with a cloud data volume
        of approximately TB~100TB/time. Nowadays, the gene sequencing data
        source and supercomputing cloud data center are connected through a
        fixed bandwidth dedicated line, which is expensive and lacks of
        cost-effective solutions.</t>
      </section>
    </section>

    <section anchor="ProblemStatement" title="Problem Statement of ODES ">
      <t>With the vigorous development of industrial digitalization and cloud
      computing, the demand for high-capacity data transmission in different
      places is increasing. At the same time, scenarios such as multi-cloud
      data backup and data on remote cloud in different places put forward
      higher requirements on the throughput of online data transmission. There
      is an urgent need to achieve high-throughput transmission of massive
      data in WAN.</t>

      <t>By analyzing and summarizing the above typical application scenarios,
      we obtain the following common features.</t>

      <t><list style="numbers">
          <t>Large amount of data transmitted in a single time: TB~PB.</t>

          <t>High flow transmission frequency: There is a demand for regular
          or irregular data transmission, with high peak bandwidth
          requirements.</t>

          <t>Low real-time requirements:&nbsp;Mainly warm or cold data, not
          strongly real-time hot data, but the faster the transmission
          completion time, the better.</t>

          <t>Cost-sensitive:&nbsp;Customers do not want to pay for high
          bandwidth dedicated lines separately because the transmission
          frequency is variable, which leads to low network utilization and
          cost-effectiveness.</t>
        </list>To support differentiated data delivery services and create
      task-based data delivery services, the target ODES system should meet
      the following requirements.</t>

      <t><list style="numbers">
          <t>High throughput in WAN is an important goal of data express
          service. Technologies such as wide-area RDMA and Elephant flow load
          balancing could be utilized to achieve high throughput network data
          transmission.</t>

          <t>Compared with using fixed bandwidth of traditional dedicated
          lines for data transmission, data express supports the bandwidth
          elastic expansion and contraction function, providing users with
          plug and play, stop after use and highly elastic services.</t>

          <t>Data, as the core asset of users, enterprises, social
          organizations, etc., are related to the interests of all parties, so
          we must attach great importance to data security. On the basis of
          providing efficient transmission services, data express strictly
          guarantees the security of user data and ensures that user data will
          not be stolen or tampered with.</t>

          <t>The data express business has the characteristics of large amount
          of data and long transmission distance. In order to better meet the
          business requirements, it is needed to ensure that the data can be
          transmitted to the destination in a time as short as possible if
          required, so that the flow completion time can be shorten.</t>
        </list></t>

      <t/>
    </section>

    <section anchor="IANA" title="IANA Considerations">
      <t>TBD.</t>
    </section>

    <section anchor="Security" title="Security Considerations">
      <t>TBD.</t>
    </section>

    <section anchor="Acknowledgements" title="Acknowledgements">
      <t>TBD.</t>
    </section>

    <section title="Contributors">
      <t>The following people have substantially contributed to this
      document:</t>

      <t><figure>
          <artwork><![CDATA[	Guangyu Zhao
	China Mobile
	zhaoguangyu@chinamobile.com


]]></artwork>
        </figure></t>
    </section>
  </middle>

  <back>
    <references title="Normative References">
      <?rfc include="reference.RFC.2119"?>
    </references>

    <references title="Informative References">
      <?rfc include="reference.I-D.zhao-tsvwg-odes-gap-analysis"?>
    </references>
  </back>
</rfc>
