Failed to import the NeMo framework or its dependencies! 'pip install -U \"batchalign[speaker]

### Issue:

In https://talkbank.org/info/BA2-usage.pdf:

> To use Batchalign, you first have to install a version of Python greater than 3.9. You can
> install the current version from https://python.org/downloads. Some external
> dependencies will be installed automatically when you install Batchalign. This means
> that, you should not install any dependencies (like Whisper or Nvidia Nemo) manually. If
> a special function requires a manual dependency, the Batchalign program will prompt
> you with instructions

## Attempting to get a transcript with a working diarize to run on my local machine.
`batchalign -vvv transcribe --lang=eng --whisper --num_speakers=2 --diarize .\.tmp_test\in .\.tmp_test\out\`
<details><summary>Expand for stacktrace</summary><pre>
[03/02/25 10:45:02] DEBUG    Attempting to create               dispatch.py:134
                             BatchalignPipeline for CLI...                     
                    DEBUG    Initializing packages, got:         dispatch.py:64
                             packages='['asr', 'speaker']' and                 
                             config='{'DEFAULT': <Section:                     
                             DEFAULT>, 'asr': <Section: asr>,                  
                             'ud': <Section: ud>}'                             
                    INFO     Initializing engines...             dispatch.py:85
                    INFO     -------------------------------     dispatch.py:86
                    INFO     | asr          |      whisper |    dispatch.py:100
                    INFO     -------------------------------    dispatch.py:101
                    DEBUG    Initializing whisper model...      infer_asr.py:67
Device set to use cpu
[03/02/25 10:45:03] DEBUG    Done, initalizing processor and    infer_asr.py:81
                             config...                                         
                    DEBUG    Whisper initialization done.       infer_asr.py:83
                    DEBUG    Initializing utterance model...      whisper.py:46
                    DEBUG    Done.                                whisper.py:52
                    INFO     | speaker      | nemo_speaker |    dispatch.py:100
                    INFO     -------------------------------    dispatch.py:101
+--------------------- Traceback (most recent call last) ---------------------+
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\batchalign\models\speaker\infer.py:27 in __init__                         |
|                                                                             |
|   24 class NemoSpeakerModel(object):                                        |
|   25     def __init__(self):                                                |
|   26         try:                                                           |
| > 27             from omegaconf import OmegaConf                            |
|   28             self.__base = OmegaConf.load(resolve_config())             |
|   29         except ImportError:                                            |
|   30             self.__raise()                                             |
+-----------------------------------------------------------------------------+
ModuleNotFoundError: No module named 'omegaconf'
System.Management.Automation.RemoteException
During handling of the above exception, another exception occurred:
System.Management.Automation.RemoteException
+--------------------- Traceback (most recent call last) ---------------------+
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\batchalign\cli\dispatch.py:135 in _dispatch                               |
|                                                                             |
|   132                                                                       |
|   133         # create pipeline and read files                              |
|   134         baL.debug("Attempting to create BatchalignPipeline for CLI... |
| > 135         pipeline = BatchalignPipeline.new(Cmd2Task[command],          |
|   136                                           lang=lang, num_speakers=num |
|   137         baL.debug(f"Successfully created BatchalignPipeline... {pipel |
|   138                                                                       |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\batchalign\pipelines\pipeline.py:60 in new                                |
|                                                                             |
|    57         """                                                           |
|    58                                                                       |
|    59         from batchalign.pipelines.dispatch import dispatch_pipeline   |
| >  60         return dispatch_pipeline(tasks, lang=lang, num_speakers=num_s |
|    61                                                                       |
|    62     def __call__(self, input, callback=None, **kwargs):               |
|    63         """Call the pipeline.                                         |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\batchalign\pipelines\dispatch.py:126 in dispatch_pipeline                 |
|                                                                             |
|   123         elif engine == "evaluation":                                  |
|   124             engines.append(EvaluationEngine())                        |
|   125         elif engine == "nemo_speaker":                                |
| > 126             engines.append(NemoSpeakerEngine(num_speakers=num_speaker |
|   127         elif engine == "stanza_utt":                                  |
|   128             engines.append(StanzaUtteranceEngine())                   |
|   129         elif engine == "stanza_coref":                                |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\batchalign\pipelines\speaker\nemo_speaker.py:26 in __init__               |
|                                                                             |
|   23                                                                        |
|   24         self.status_hook = None                                        |
|   25         self.num_speakers = num_speakers                               |
| > 26         self.__model = NemoSpeakerModel()                              |
|   27                                                                        |
|   28     def process(self, doc:Document, **kwargs):                         |
|   29         # check that the document has a media path to align to         |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\batchalign\models\speaker\infer.py:30 in __init__                         |
|                                                                             |
|   27             from omegaconf import OmegaConf                            |
|   28             self.__base = OmegaConf.load(resolve_config())             |
|   29         except ImportError:                                            |
| > 30             self.__raise()                                             |
|   31                                                                        |
|   32     def __raise(self):                                                 |
|   33         raise ImportError("Failed to import the NeMo framework or its  |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\batchalign\models\speaker\infer.py:33 in __raise                          |
|                                                                             |
|   30             self.__raise()                                             |
|   31                                                                        |
|   32     def __raise(self):                                                 |
| > 33         raise ImportError("Failed to import the NeMo framework or its  |
|   34                                                                        |
|   35     def __call__(self, in_file, num_speakers=2):                       |
|   36         try:                                                           |
+-----------------------------------------------------------------------------+
ImportError: Failed to import the NeMo framework or its dependencies!
Hint: run 'pip install -U "batchalign[speaker]"' to install speaker diarization
tools.
System.Management.Automation.RemoteException
During handling of the above exception, another exception occurred:
System.Management.Automation.RemoteException
+--------------------- Traceback (most recent call last) ---------------------+
| in _run_module_as_main:198                                                  |
| in _run_code:88                                                             |
|                                                                             |
| in <module>:7                                                               |
|                                                                             |
|   4 from batchalign.cli.cli import batchalign                               |
|   5 if __name__ == '__main__':                                              |
|   6     sys.argv[0] = re.sub(r'(-script\.pyw|\.exe)?$', '', sys.argv[0])    |
| > 7     sys.exit(batchalign())                                              |
|   8                                                                         |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\rich_click\rich_command.py:367 in __call__                                |
|                                                                             |
|   364         # Include this here because I run into a false warning        |
|   365         # in the PyCharm IDE otherwise; for some reason PyCharm doesn |
|   366         # seem to think RichGroups are callable. (No issues with Mypy |
| > 367         return super().__call__(*args, **kwargs)                      |
|   368                                                                       |
|   369                                                                       |
|   370 class RichCommandCollection(CommandCollection, RichGroup):            |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\click\core.py:1161 in __call__                                            |
|                                                                             |
|   1158                                                                      |
|   1159     def __call__(self, *args: t.Any, **kwargs: t.Any) -> t.Any:      |
|   1160         """Alias for :meth:`main`."""                                |
| > 1161         return self.main(*args, **kwargs)                            |
|   1162                                                                      |
|   1163                                                                      |
|   1164 class Command(BaseCommand):                                          |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\rich_click\rich_command.py:152 in main                                    |
|                                                                             |
|   149         try:                                                          |
|   150             try:                                                      |
|   151                 with self.make_context(prog_name, args, **extra) as c |
| > 152                     rv = self.invoke(ctx)                             |
|   153                     if not standalone_mode:                           |
|   154                         return rv                                     |
|   155                     # it's not safe to `ctx.exit(rv)` here!           |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\click\core.py:1697 in invoke                                              |
|                                                                             |
|   1694                 super().invoke(ctx)                                  |
|   1695                 sub_ctx = cmd.make_context(cmd_name, args, parent=ct |
|   1696                 with sub_ctx:                                        |
| > 1697                     return _process_result(sub_ctx.command.invoke(su |
|   1698                                                                      |
|   1699         # In chain mode we create the contexts step by step, but aft |
|   1700         # base command has been invoked.  Because at that point we d |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\click\core.py:1443 in invoke                                              |
|                                                                             |
|   1440             echo(style(message, fg="red"), err=True)                 |
|   1441                                                                      |
|   1442         if self.callback is not None:                                |
| > 1443             return ctx.invoke(self.callback, **ctx.params)           |
|   1444                                                                      |
|   1445     def shell_complete(self, ctx: Context, incomplete: str) -> t.Lis |
|   1446         """Return a list of completions for the incomplete value. Lo |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\click\core.py:788 in invoke                                               |
|                                                                             |
|    785                                                                      |
|    786         with augment_usage_errors(__self):                           |
|    787             with ctx:                                                |
| >  788                 return __callback(*args, **kwargs)                   |
|    789                                                                      |
|    790     def forward(__self, __cmd: "Command", *args: t.Any, **kwargs: t. |
|    791         """Similar to :meth:`invoke` but fills in default keyword    |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\click\decorators.py:33 in new_func                                        |
|                                                                             |
|    30     """                                                               |
|    31                                                                       |
|    32     def new_func(*args: "P.args", **kwargs: "P.kwargs") -> "R":       |
| >  33         return f(get_current_context(), *args, **kwargs)              |
|    34                                                                       |
|    35     return update_wrapper(new_func, f)                                |
|    36                                                                       |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\batchalign\cli\cli.py:187 in transcribe                                   |
|                                                                             |
|   184                                 write_wor=kwargs.get("wor", False))   |
|   185                                                                       |
|   186     if kwargs.get("diarize"):                                         |
| > 187         _dispatch("transcribe_s",                                     |
|   188                   lang, num_speakers, ["mp3", "mp4", "wav"], ctx,     |
|   189                   in_dir, out_dir,                                    |
|   190                   loader, writer, C,                                  |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\batchalign\cli\dispatch.py:126 in _dispatch                               |
|                                                                             |
|   123     # cache the errors                                                |
|   124     errors = []                                                       |
|   125                                                                       |
| > 126     with prog as prog:                                                |
|   127         tasks = {}                                                    |
|   128         errors = []                                                   |
|   129         # create the spinner bars                                     |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\rich\progress.py:1191 in __exit__                                         |
|                                                                             |
|   1188         exc_val: Optional[BaseException],                            |
|   1189         exc_tb: Optional[TracebackType],                             |
|   1190     ) -> None:                                                       |
| > 1191         self.stop()                                                  |
|   1192                                                                      |
|   1193     def track(                                                       |
|   1194         self,                                                        |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\rich\progress.py:1177 in stop                                             |
|                                                                             |
|   1174                                                                      |
|   1175     def stop(self) -> None:                                          |
|   1176         """Stop the progress display."""                             |
| > 1177         self.live.stop()                                             |
|   1178         if not self.console.is_interactive and not self.console.is_j |
|   1179             self.console.print()                                     |
|   1180                                                                      |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\rich\live.py:147 in stop                                                  |
|                                                                             |
|   144                 self._refresh_thread = None                           |
|   145             # allow it to fully render on the last even if overflow   |
|   146             self.vertical_overflow = "visible"                        |
| > 147             with self.console:                                        |
|   148                 try:                                                  |
|   149                     if not self._alt_screen and not self.console.is_j |
|   150                         self.refresh()                                |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\rich\console.py:864 in __exit__                                           |
|                                                                             |
|    861                                                                      |
|    862     def __exit__(self, exc_type: Any, exc_value: Any, traceback: Any |
|    863         """Exit buffer context."""                                   |
| >  864         self._exit_buffer()                                          |
|    865                                                                      |
|    866     def begin_capture(self) -> None:                                 |
|    867         """Begin capturing console output. Call :meth:`end_capture`  |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\rich\console.py:822 in _exit_buffer                                       |
|                                                                             |
|    819     def _exit_buffer(self) -> None:                                  |
|    820         """Leave buffer context, and render content if required."""  |
|    821         self._buffer_index -= 1                                      |
| >  822         self._check_buffer()                                         |
|    823                                                                      |
|    824     def set_live(self, live: "Live") -> None:                        |
|    825         """Set Live instance. Used by Live context manager.          |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\rich\console.py:2019 in _check_buffer                                     |
|                                                                             |
|   2016             return                                                   |
|   2017                                                                      |
|   2018         try:                                                         |
| > 2019             self._write_buffer()                                     |
|   2020         except BrokenPipeError:                                      |
|   2021             self.on_broken_pipe()                                    |
|   2022                                                                      |
|                                                                             |
| c:\Users\PC\Documents\python_scripts\videotranscript\.venv\Lib\site-package |
| s\rich\console.py:2067 in _write_buffer                                     |
|                                                                             |
|   2064                             MAX_WRITE = 32 * 1024 // 4               |
|   2065                             try:                                     |
|   2066                                 if len(text) <= MAX_WRITE:           |
| > 2067                                     write(text)                      |
|   2068                                 else:                                |
|   2069                                     batch: List[str] = []            |
|   2070                                     batch_append = batch.append      |
|                                                                             |
| C:\Users\PC\AppData\Local\Programs\Python\Python312\Lib\tempfile.py:499 in  |
| func_wrapper                                                                |
|                                                                             |
|   496             func = a                                                  |
|   497             @_functools.wraps(func)                                   |
|   498             def func_wrapper(*args, **kwargs):                        |
| > 499                 return func(*args, **kwargs)                          |
|   500             # Avoid closing the file as long as the wrapper is alive, |
|   501             # see issue #18879.                                       |
|   502             func_wrapper._closer = self._closer                       |
|                                                                             |
| C:\Users\PC\AppData\Local\Programs\Python\Python312\Lib\encodings\cp1252.py |
| :19 in encode                                                               |
|                                                                             |
|    16                                                                       |
|    17 class IncrementalEncoder(codecs.IncrementalEncoder):                  |
|    18     def encode(self, input, final=False):                             |
| >  19         return codecs.charmap_encode(input,self.errors,encoding_table |
|    20                                                                       |
|    21 class IncrementalDecoder(codecs.IncrementalDecoder):                  |
|    22     def decode(self, input, final=False):                             |
+-----------------------------------------------------------------------------+
UnicodeEncodeError: 'charmap' codec can't encode character '\u2827' in position
0: character maps to <undefined>
*** You may need to add PYTHONIOENCODING=utf-8 to your environment ***

</pre></details> 
I tried this with python 3.12.9, 3.11.11, and 3.9 (all in a fresh venv).
Whats the deal?



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Failed to import the NeMo framework or its dependencies! 'pip install -U \"batchalign[speaker] #30

Issue:

Attempting to get a transcript with a working diarize to run on my local machine.

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Failed to import the NeMo framework or its dependencies! 'pip install -U \"batchalign[speaker] #30

Description

Issue:

Attempting to get a transcript with a working diarize to run on my local machine.

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions