Unify input and local declarations in data model #799

eemeli · 2024-05-22T12:51:12Z

Closes #786

This change is a part of what was discussed in #718, i.e. the part that didn't get any critique in the issue.

In the syntax, keeping .input and .local as separate operations makes really good sense from a readability point of view. In the data model, as those concerns are not present, there is no reason to keep the separate type: 'input' and type: 'local' blocks. Both will almost certainly be processed the same way, and the input/local-ness can be inferred when necessary by

type = value.arg?.type == 'variable' && value.arg.name == name ? 'input' : 'local'

This also drops the need for a VariableExpression, which simplifies the TS & JSON Schema definitions quite a bit.

catamorphism

I think I've commented on this before, but the implication of this change is that either:

the parser has to check for all duplicate declaration errors
duplicate declaration error checking has to be split, with a special case in the parser for .local $x = {$x :func}, and then the more general check in a subsequent error-checking pass.

IMO it's useful for .local $x = {$x :func} to be representable in the data model, so that all duplicate declarations can be checked in a single pass that's separate from the parser.

catamorphism · 2024-05-27T09:30:14Z

spec/data-model/README.md

@@ -78,13 +78,13 @@ type Message = PatternMessage | SelectMessage;

 interface PatternMessage {
  type: "message";
-  declarations: Declaration[];
+  declarations: (Declaration | UnsupportedStatement)[];


Could the declarations field be renamed to something else? I'm not sure what, but it seems confusing to me that a list called "declarations" can include something that's not a declaration.

echeran

This PR does seem to introduce an unnecessary complexity (and thus burden) on implementations, as @catamorphism. As I understand it, this PR would require that an implementation's parser would have to complect together the separate concerns of parsing and duplicate declaration.

I like the attempt at simplifying the spec, but it currently doesn't seem to me to an improvement b/c it undoes an important distinction. Although, this discussion is useful and perhaps can be converted into important documentation explaining why these constructs have to remain separate. WDYT?

FWIW, for the record, even though the PR description says that it represents the uncritiqued parts of the discussion in #718, I see concerns on this topic in that discussion that didn't get resolved.

stasm · 2024-07-08T16:18:36Z

spec/data-model/README.md

+If the `value` has a `VariableRef` `arg` with the same `name`
+as the `Declaration`,
+it represents an _input-declaration_.
+Otherwise, it represents a _local-declaration_.


I'm not fond of the blanket value: Expression, which requires this wording here. I'd much prefer if the data model didn't need these additional validity constraints. I think the current data model achieves this to a larger extent.

I wouldn't want anyone to think that it's OK to have .input {:func} or .input {|1|} -- which I think the proposed change does suggest.

This change is in fact removing a MUST validity constraint on InputDeclaration, in addition to removing a similar implicit constraint on the LocalDeclaration that it must not contain in its value a VariableExpression which uses the same name as used at the top level of the declaration.

Could you explain how this proposed language suggests that something like .input {:func} is ok? As it explicitly requires the expression to have a VariableRef argument with a matching name, isn't that the same set of requirements that are expressed in the syntax?

stasm · 2024-07-08T16:19:53Z

spec/data-model/README.md

-  annotation?: FunctionAnnotation | UnsupportedAnnotation;
-  attributes: Attributes;
-}
+type Expression = OperandExpression | AnnotationExpression;


This is an interesting suggestion worth pursuing in a separate PR.

mihnita · 2024-07-22T17:28:13Z

I don't think these should be unified, they look the same, but are different concepts.

For example a translator can define as many local variables as they want.
But they MUST NOT define inputs, as inputs correspond to something in the calling code.

Similar to programming languages:

function foo(String arg1, int arg2 = 42) { // input
   int bar = 7 // local
   int baz = arg2 // local
   String locStr = arg1 // local

TLDR: not unify, they look superficially the same, but are different concepts.

eemeli · 2024-07-29T06:25:02Z

Responding to @echeran:

I like the attempt at simplifying the spec, but it currently doesn't seem to me to an improvement b/c it undoes an important distinction.

With this change, the .input and .local declarations are still distinct and distinguishable from each other in the data model, as the sets of valid "input" and "local" object contents are exclusive of each other. This means that we're removing a current duplication of data.

Responding to @mihnita:

I don't think these should be unified, they look the same, but are different concepts.

For example a translator can define as many local variables as they want. But they MUST NOT define inputs, as inputs correspond to something in the calling code.

As discussed during the 2024-07-15 call, there are situations in which it may be valid for a translator (or their tooling) to add a .input declaration where one did not exist previously.

I can provide a real-world example with this Firefox message, which in MF2 is effectively

{$num :integer} (default)

in the source locale (en-US), but its Japanese translation depends on the OS, requiring the use of a custom :platform selector:

.input {$num :integer}
.match {:platform}
macos {{{$num} (デフォルト)}}
* {{{$num} (既定)}}

Note how the introduction of the .input declaration has not changed the requirements or restrictions on the num value, as in both the source and target locales it's formatted the same way using :integer.

MF2 allows for this, while also allowing for a specific user to choose to restrict changes such as the above. It's also good to note that similar restrictions on the input may also be imposed by a .local declaration:

.local $foo = {$num :integer}
.match {:platform}
macos {{{$foo} (デフォルト)}}
* {{{$foo} (既定)}}

As in the example above using .input, this message requires the num value to be usable as an integer.

In other words, the more appropriate "MUST NOT" to impose on translators (e.g. via tooling) is changes to expressions where the operand is externally provided. Say, if the original was instead

{$num} (default)

then it would not be valid to change the expression to {$num :number} in any locale based on a presumption about the value's type from its name.

That is a restriction which applies to all .input declarations, but it also applies to many .local declarations and placeholders as well; we really should not have .input be special in the data model, as it could mislead other developers the same way about what sorts of changes and transforms could be safe, and which are not.

aphillips · 2024-09-10T16:58:22Z

Discussed in 2024-09-10 call. Moving to post-46 but pre-2.0

Unify input and local declarations in data model

ac4d83a

eemeli added the data model Issues related with MF data Model label May 22, 2024

catamorphism reviewed May 27, 2024

View reviewed changes

Merge branch 'main' into simpler-data

44f3767

aphillips approved these changes Jun 9, 2024

View reviewed changes

echeran requested changes Jun 12, 2024

View reviewed changes

stasm requested changes Jul 8, 2024

View reviewed changes

aphillips mentioned this pull request Jul 9, 2024

[FEEDBACK] What is the semantic difference between local declaration using itself, and an input declaration #819

Open

eemeli linked an issue Jul 29, 2024 that may be closed by this pull request

[FEEDBACK] What is the semantic difference between local declaration using itself, and an input declaration #819

Open

aphillips added blocker-candidate The submitter thinks this might be a block for the Technology Preview Future Deferred for future standardization labels Sep 10, 2024

samuelstroschein mentioned this pull request Sep 13, 2024

Model import -> CRUD (sdk) -> export opral/inlang-sdk#195

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify input and local declarations in data model #799

Unify input and local declarations in data model #799

eemeli commented May 22, 2024

catamorphism left a comment

catamorphism May 27, 2024

echeran left a comment

stasm Jul 8, 2024

eemeli Jul 29, 2024

stasm Jul 8, 2024

mihnita commented Jul 22, 2024 •

edited

Loading

eemeli commented Jul 29, 2024 •

edited

Loading

aphillips commented Sep 10, 2024

Unify input and local declarations in data model #799

Are you sure you want to change the base?

Unify input and local declarations in data model #799

Conversation

eemeli commented May 22, 2024

catamorphism left a comment

Choose a reason for hiding this comment

catamorphism May 27, 2024

Choose a reason for hiding this comment

echeran left a comment

Choose a reason for hiding this comment

stasm Jul 8, 2024

Choose a reason for hiding this comment

eemeli Jul 29, 2024

Choose a reason for hiding this comment

stasm Jul 8, 2024

Choose a reason for hiding this comment

mihnita commented Jul 22, 2024 • edited Loading

eemeli commented Jul 29, 2024 • edited Loading

aphillips commented Sep 10, 2024

mihnita commented Jul 22, 2024 •

edited

Loading

eemeli commented Jul 29, 2024 •

edited

Loading