Jupyterlab extension to browse HDFS filesystem
npm install @swan-cern/hdfsbrowserHadoop JupyterLab Extension
This extension is composed of a Python package named hdfsbrowser, which installs the server+nbextension and a NPM package named @swan-cern/hdfsbrowser
for the JupyterLab extension.
* JupyterLab >= 2.1
Note: You will need NodeJS to install the extension.
``bash`
pip install hdfsbrowser
jupyter nbextension install hdfsbrowser --py
jupyter nbextension enable hdfsbrowser --py
jupyter lab build
Configure notebook jupyter_notebook_config.py:
``
c.HDFSBrowserConfig.hdfs_site_path = "/cvmfs/sft.cern.ch/lcg/etc/hadoop-confext/conf/etc/analytix/hadoop.analytix/hdfs-site.xml"
c.HDFSBrowserConfig.hdfs_site_namenodes_property = "dfs.ha.namenodes.analytix"
c.HDFSBrowserConfig.hdfs_site_namenodes_port = "50070"
c.HDFSBrowserConfig.webhdfs_token = "dummy"
If you are not seeing the frontend, check if it's installed:
`bash`
jupyter labextension list
If it is installed, try:
`bash`
jupyter lab clean
jupyter lab build
The jlpm command is JupyterLab's pinned version ofyarn
yarn that is installed with JupyterLab. You may use or npm in lieu of jlpm below.
`bashClone the repo to your local environment
Move to hdfsbrowser directory
You can watch the source directory and run JupyterLab in watch mode to watch for changes in the extension's source and automatically rebuild the extension and application.
`bash
Watch the source directory in another terminal tab
jlpm watch
Run jupyterlab in watch mode in one terminal tab
jupyter lab --watch
`$3
`bash
pip uninstall hdfsbrowser
jupyter labextension uninstall @swan-cern/hdfsbrowser
``