Optimize memory efficiency in adaptive model architecture #11

csmangum · 2025-04-02T03:40:31Z

Related to #6

Optimize memory efficiency in adaptive models by implementing conditional computation, parameter sharing, and low-rank approximations.

Conditional Computation Architecture:
- Modify AdaptiveEntropyBottleneck in meaning_transform/src/models/adaptive_entropy_bottleneck.py to create projection layers only if compression exceeds a threshold.
- Add low-rank approximations for large projections in AdaptiveEntropyBottleneck.
Parameter Sharing in FeatureGroupedVAE:
- Implement parameter sharing across feature groups in FeatureGroupedVAE in meaning_transform/src/models/feature_grouped_vae.py.
- Update FeatureGroupedVAE to use shared components for each feature group.
Documentation Update:
- Update docs/agent_memory_architecture.md to reflect the new architecture with conditional computation, parameter sharing, and low-rank approximations.

For more details, open the Copilot Workspace session.

Related to #6 Optimize memory efficiency in adaptive models by implementing conditional computation, parameter sharing, and low-rank approximations. * **Conditional Computation Architecture**: - Modify `AdaptiveEntropyBottleneck` in `meaning_transform/src/models/adaptive_entropy_bottleneck.py` to create projection layers only if compression exceeds a threshold. - Add low-rank approximations for large projections in `AdaptiveEntropyBottleneck`. * **Parameter Sharing in FeatureGroupedVAE**: - Implement parameter sharing across feature groups in `FeatureGroupedVAE` in `meaning_transform/src/models/feature_grouped_vae.py`. - Update `FeatureGroupedVAE` to use shared components for each feature group. * **Documentation Update**: - Update `docs/agent_memory_architecture.md` to reflect the new architecture with conditional computation, parameter sharing, and low-rank approximations. --- For more details, open the [Copilot Workspace session](https://copilot-workspace.githubnext.com/Dooders/AgentMeaning/issues/6?shareId=XXXX-XXXX-XXXX-XXXX).

Copilot

Pull Request Overview

This PR optimizes memory efficiency for adaptive models by introducing conditional computation, low-rank approximations, and parameter sharing.

Modified AdaptiveEntropyBottleneck to conditionally create projection layers based on a compression threshold and to use low-rank approximations for large projections.
Updated FeatureGroupedVAE to share a common compressor across feature groups and replaced group-specific bottlenecks with shared components.
Revised documentation in docs/agent_memory_architecture.md to reflect the new architectural changes.

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

File	Description
meaning_transform/src/models/feature_grouped_vae.py	Implemented shared compressor and updated loss and rate computations for groups.
meaning_transform/src/models/adaptive_entropy_bottleneck.py	Added conditional logic for projection layers and integrated low-rank approximations.
docs/agent_memory_architecture.md	Updated documentation to include details on conditional computation and sharing.

Comments suppressed due to low confidence (3)

meaning_transform/src/models/feature_grouped_vae.py:74

Consider defining a dedicated nn.Module subclass for the shared compressor to encapsulate the mu and scale networks, as it improves clarity and maintainability.

self.shared_compressor = nn.Module()

meaning_transform/src/models/feature_grouped_vae.py:246

Review the compression loss computation to ensure it scales appropriately for each feature group and maintains numerical stability; consider extracting the constant into a predefined variable.

compression_loss += 0.5 * log_scale_group.mul(2).exp() + 0.5 * torch.log(2 * torch.tensor(torch.pi, device=z.device))

meaning_transform/src/models/adaptive_entropy_bottleneck.py:57

Ensure that latent_dim is large enough so that latent_dim // 4 is non-zero; otherwise, the projection layers may not function as intended.

self.proj_up = nn.Sequential(
                    nn.Linear(self.effective_dim, latent_dim // 4),
                    nn.LeakyReLU(),
                    nn.Linear(latent_dim // 4, latent_dim * 2)
                )

csmangum requested a review from Copilot April 2, 2025 03:41

Copilot AI reviewed Apr 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize memory efficiency in adaptive model architecture #11

Optimize memory efficiency in adaptive model architecture #11

Uh oh!

csmangum commented Apr 2, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Optimize memory efficiency in adaptive model architecture #11

Are you sure you want to change the base?

Optimize memory efficiency in adaptive model architecture #11

Uh oh!

Conversation

csmangum commented Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

csmangum commented Apr 2, 2025 •

edited

Loading