<html>
    <head>
        <title>DAVIS Visualization --- TAPNet</title>
        <link rel="StyleSheet" href="style.css" type="text/css" media="all" /> 
        <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
    </head>

    <body>
        <div id="primarycontent">
            <a href="index.html">back</a>
            <h1>TAP-Net: Tracking Any Point in a Video</h1><hr>
            <h2>DAVIS Point Tracking</h2>
            <p style="width:auto">
              TAPNet generalizes to real-world videos from the DAVIS benchmark,
              even though training labels came exclusively from the synthetic Kubric dataset. 
              For each example, we show each tracked point in a different color.
              For simplicity, all query points are given on the first frame, although our network is capable of tracking queries from any frame.
              The points
              are typically tracked consistently over hundreds of frames under challenging
              occlusions, changes in appearance and pose of the objects.  However, small objects
              and changes in scale remain challenging.
            </p>

            <table width="auto">
                <tr><td>
                    <video controls autoplay>
                        <source src="point_tracks_prediction/goat.mp4" type="video/mp4">
                    </video>
                      <p style="width:800px">The goat's body is tracked quite well, despite having texture which is somewhat similar to the background.</p>
                </td></tr>
                <tr><td>
                    <video controls autoplay>
                    <source src="point_tracks_prediction/blackswan.mp4" type="video/mp4">
                    </video>
                    <p style="width:100%">On this example, the swan's body and face are tracked very well.  However bill is too thin and our algorithm loses track of one point.</p>
                </td></tr>
                <tr><td>
                    <video controls autoplay>
                        <source src="point_tracks_prediction/camel.mp4" type="video/mp4">
                    </video>
                      <p style="width:800px">Some points on these camels are tracked quite well, including one on the hump of a heavily occluded camel.  However, large changes in viewpoint, as well as thin structures cause some failures toward the end of the video.</p>
                </td></tr>
                <tr><td>
                    <video controls autoplay>
                        <source src="point_tracks_prediction/india.mp4" type="video/mp4">
                      </video>
                      <p style="width:800px">In this example, the network starts off very precise, but begins to deteriorate after large changes in scale due to the zooming camera.</p>
                </td></tr>
            </table>
        </div>
    </body>

</html>
