LIMIT for UPDATE and DELETE

Started by Rukh Meskiover 11 years ago29 messages

rukh.meski@gmail.com

over 11 years ago

1 attachment(s)

Greetings,

Based on the feedback on my previous patch, I've separated only the
LIMIT part into its own feature. This version plays nicely with
inheritance. The intended use is splitting up big UPDATEs and DELETEs
into batches more easily and efficiently.

♜

Attachments:

update_delete_limit_v1.difftext/plain; charset=US-ASCII; name=update_delete_limit_v1.diffDownload

*** a/doc/src/sgml/ref/delete.sgml
--- b/doc/src/sgml/ref/delete.sgml
***************
*** 25,30 **** PostgreSQL documentation
--- 25,31 ----
  DELETE FROM [ ONLY ] <replaceable class="PARAMETER">table_name</replaceable> [ * ] [ [ AS ] <replaceable class="parameter">alias</replaceable> ]
      [ USING <replaceable class="PARAMETER">using_list</replaceable> ]
      [ WHERE <replaceable class="PARAMETER">condition</replaceable> | WHERE CURRENT OF <replaceable class="PARAMETER">cursor_name</replaceable> ]
+     [ LIMIT { <replaceable class="parameter">count</replaceable> | ALL } ]
      [ RETURNING * | <replaceable class="parameter">output_expression</replaceable> [ [ AS ] <replaceable class="parameter">output_name</replaceable> ] [, ...] ]
  </synopsis>
   </refsynopsisdiv>
***************
*** 56,61 **** DELETE FROM [ ONLY ] <replaceable class="PARAMETER">table_name</replaceable> [ *
--- 57,70 ----
    </para>
  
    <para>
+    If the <literal>LIMIT</> (or <literal>FETCH FIRST</>) clause
+    is present, processing will stop after the system has deleted the
+    specified amount of rows.  Unlike in <literal>SELECT</>, the
+    <literal>OFFSET</literal> clause is not available in
+    <literal>DELETE</>.
+   </para>
+ 
+   <para>
     The optional <literal>RETURNING</> clause causes <command>DELETE</>
     to compute and return value(s) based on each row actually deleted.
     Any expression using the table's columns, and/or columns of other
*** a/doc/src/sgml/ref/update.sgml
--- b/doc/src/sgml/ref/update.sgml
***************
*** 29,34 **** UPDATE [ ONLY ] <replaceable class="PARAMETER">table_name</replaceable> [ * ] [
--- 29,35 ----
          } [, ...]
      [ FROM <replaceable class="PARAMETER">from_list</replaceable> ]
      [ WHERE <replaceable class="PARAMETER">condition</replaceable> | WHERE CURRENT OF <replaceable class="PARAMETER">cursor_name</replaceable> ]
+     [ LIMIT { <replaceable class="parameter">count</replaceable> | ALL } ]
      [ RETURNING * | <replaceable class="parameter">output_expression</replaceable> [ [ AS ] <replaceable class="parameter">output_name</replaceable> ] [, ...] ]
  </synopsis>
   </refsynopsisdiv>
***************
*** 51,56 **** UPDATE [ ONLY ] <replaceable class="PARAMETER">table_name</replaceable> [ * ] [
--- 52,66 ----
     circumstances.
    </para>
  
+ 
+   <para>
+    If the <literal>LIMIT</> (or <literal>FETCH FIRST</>) clause
+    is present, processing will stop after the system has updated
+    the specified amount of rows.  Unlike in <literal>SELECT</>, the
+    <literal>OFFSET</literal> clause is not available in
+    <literal>UPDATE</>.
+   </para>
+ 
    <para>
     The optional <literal>RETURNING</> clause causes <command>UPDATE</>
     to compute and return value(s) based on each row actually updated.
*** a/src/backend/executor/nodeModifyTable.c
--- b/src/backend/executor/nodeModifyTable.c
***************
*** 323,329 **** ExecDelete(ItemPointer tupleid,
  		   TupleTableSlot *planSlot,
  		   EPQState *epqstate,
  		   EState *estate,
! 		   bool canSetTag)
  {
  	ResultRelInfo *resultRelInfo;
  	Relation	resultRelationDesc;
--- 323,329 ----
  		   TupleTableSlot *planSlot,
  		   EPQState *epqstate,
  		   EState *estate,
! 		   int64_t *processed)
  {
  	ResultRelInfo *resultRelInfo;
  	Relation	resultRelationDesc;
***************
*** 480,487 **** ldelete:;
  		 */
  	}
  
! 	if (canSetTag)
! 		(estate->es_processed)++;
  
  	/* AFTER ROW DELETE Triggers */
  	ExecARDeleteTriggers(estate, resultRelInfo, tupleid, oldtuple);
--- 480,486 ----
  		 */
  	}
  
! 	(*processed)++;
  
  	/* AFTER ROW DELETE Triggers */
  	ExecARDeleteTriggers(estate, resultRelInfo, tupleid, oldtuple);
***************
*** 572,578 **** ExecUpdate(ItemPointer tupleid,
  		   TupleTableSlot *planSlot,
  		   EPQState *epqstate,
  		   EState *estate,
! 		   bool canSetTag)
  {
  	HeapTuple	tuple;
  	ResultRelInfo *resultRelInfo;
--- 571,577 ----
  		   TupleTableSlot *planSlot,
  		   EPQState *epqstate,
  		   EState *estate,
! 		   int64_t *processed)
  {
  	HeapTuple	tuple;
  	ResultRelInfo *resultRelInfo;
***************
*** 771,778 **** lreplace:;
  												   estate);
  	}
  
! 	if (canSetTag)
! 		(estate->es_processed)++;
  
  	/* AFTER ROW UPDATE Triggers */
  	ExecARUpdateTriggers(estate, resultRelInfo, tupleid, oldtuple, tuple,
--- 770,776 ----
  												   estate);
  	}
  
! 	(*processed)++;
  
  	/* AFTER ROW UPDATE Triggers */
  	ExecARUpdateTriggers(estate, resultRelInfo, tupleid, oldtuple, tuple,
***************
*** 885,896 **** ExecModifyTable(ModifyTableState *node)
  		return NULL;
  
  	/*
! 	 * On first call, fire BEFORE STATEMENT triggers before proceeding.
  	 */
! 	if (node->fireBSTriggers)
  	{
  		fireBSTriggers(node);
! 		node->fireBSTriggers = false;
  	}
  
  	/* Preload local variables */
--- 883,919 ----
  		return NULL;
  
  	/*
! 	 * On first call, evaluate the LIMIT expression if there is one, and fire
! 	 * BEFORE STATEMENT triggers before proceeding.
  	 */
! 	if (node->firstCall)
  	{
+ 		if (node->limitCount)
+ 		{
+ 			ExprContext *econtext = node->ps.ps_ExprContext;
+ 			Datum val;
+ 			bool isNull;
+ 
+ 			val = ExecEvalExprSwitchContext(node->limitCount,
+ 											econtext,
+ 											&isNull,
+ 											NULL);
+ 			if (isNull)
+ 				node->maxProcessed = -1;
+ 			else
+ 			{
+ 				node->maxProcessed = DatumGetInt64(val);
+ 				if (node->maxProcessed < 0)
+ 					ereport(ERROR,
+ 							(errcode(ERRCODE_INVALID_ROW_COUNT_IN_LIMIT_CLAUSE),
+ 							errmsg("LIMIT must not be negative")));
+ 			}
+ 		}
+ 		else
+ 			node->maxProcessed = -1;
+ 
  		fireBSTriggers(node);
! 		node->firstCall = false;
  	}
  
  	/* Preload local variables */
***************
*** 916,921 **** ExecModifyTable(ModifyTableState *node)
--- 939,955 ----
  	for (;;)
  	{
  		/*
+ 		 * Check that we haven't hit the LIMIT yet, if one was specified.  We
+ 		 * must do this inside the loop in case a RETURNING clause was not
+ 		 * present.
+ 		 */
+ 		if (node->processed == node->maxProcessed)
+ 		{
+ 			node->mt_done = true;
+ 			break;
+ 		}
+ 
+ 		/*
  		 * Reset the per-output-tuple exprcontext.  This is needed because
  		 * triggers expect to use that context as workspace.  It's a bit ugly
  		 * to do this below the top level of the plan, however.  We might need
***************
*** 1023,1036 **** ExecModifyTable(ModifyTableState *node)
  		{
  			case CMD_INSERT:
  				slot = ExecInsert(slot, planSlot, estate, node->canSetTag);
  				break;
  			case CMD_UPDATE:
  				slot = ExecUpdate(tupleid, oldtuple, slot, planSlot,
! 								&node->mt_epqstate, estate, node->canSetTag);
  				break;
  			case CMD_DELETE:
  				slot = ExecDelete(tupleid, oldtuple, planSlot,
! 								&node->mt_epqstate, estate, node->canSetTag);
  				break;
  			default:
  				elog(ERROR, "unknown operation");
--- 1057,1077 ----
  		{
  			case CMD_INSERT:
  				slot = ExecInsert(slot, planSlot, estate, node->canSetTag);
+ 				/* estate->es_processed already updated */
  				break;
  			case CMD_UPDATE:
  				slot = ExecUpdate(tupleid, oldtuple, slot, planSlot,
! 								&node->mt_epqstate, estate, &node->processed);
! 				/* keep EState up to date */
! 				if (node->canSetTag)
! 					estate->es_processed = node->processed;
  				break;
  			case CMD_DELETE:
  				slot = ExecDelete(tupleid, oldtuple, planSlot,
! 								&node->mt_epqstate, estate, &node->processed);
! 				/* keep EState up to date */
! 				if (node->canSetTag)
! 					estate->es_processed = node->processed;
  				break;
  			default:
  				elog(ERROR, "unknown operation");
***************
*** 1091,1096 **** ExecInitModifyTable(ModifyTable *node, EState *estate, int eflags)
--- 1132,1138 ----
  
  	mtstate->operation = operation;
  	mtstate->canSetTag = node->canSetTag;
+ 	mtstate->processed = 0;
  	mtstate->mt_done = false;
  
  	mtstate->mt_plans = (PlanState **) palloc0(sizeof(PlanState *) * nplans);
***************
*** 1100,1106 **** ExecInitModifyTable(ModifyTable *node, EState *estate, int eflags)
  
  	/* set up epqstate with dummy subplan data for the moment */
  	EvalPlanQualInit(&mtstate->mt_epqstate, estate, NULL, NIL, node->epqParam);
! 	mtstate->fireBSTriggers = true;
  
  	/*
  	 * call ExecInitNode on each of the plans to be executed and save the
--- 1142,1148 ----
  
  	/* set up epqstate with dummy subplan data for the moment */
  	EvalPlanQualInit(&mtstate->mt_epqstate, estate, NULL, NIL, node->epqParam);
! 	mtstate->firstCall = true;
  
  	/*
  	 * call ExecInitNode on each of the plans to be executed and save the
***************
*** 1381,1386 **** ExecInitModifyTable(ModifyTable *node, EState *estate, int eflags)
--- 1423,1441 ----
  		estate->es_trig_tuple_slot = ExecInitExtraTupleSlot(estate);
  
  	/*
+ 	 * If we had a LIMIT, initialize it.  We will evaluate it before the first
+ 	 * row is processed.
+ 	 */
+ 	if (node->limitCount)
+ 	{
+ 		/* also create a context if RETURNING didn't already do that */
+ 		if (!mtstate->ps.ps_ExprContext)
+ 			mtstate->ps.ps_ExprContext = CreateExprContext(estate);
+ 		mtstate->limitCount = ExecInitExpr((Expr *) node->limitCount,
+ 										   (PlanState *) mtstate);
+ 	}
+ 
+ 	/*
  	 * Lastly, if this is not the primary (canSetTag) ModifyTable node, add it
  	 * to estate->es_auxmodifytables so that it will be run to completion by
  	 * ExecPostprocessPlan.  (It'd actually work fine to add the primary
*** a/src/backend/nodes/copyfuncs.c
--- b/src/backend/nodes/copyfuncs.c
***************
*** 183,188 **** _copyModifyTable(const ModifyTable *from)
--- 183,189 ----
  	COPY_NODE_FIELD(fdwPrivLists);
  	COPY_NODE_FIELD(rowMarks);
  	COPY_SCALAR_FIELD(epqParam);
+ 	COPY_NODE_FIELD(limitCount);
  
  	return newnode;
  }
***************
*** 2532,2537 **** _copyDeleteStmt(const DeleteStmt *from)
--- 2533,2539 ----
  	COPY_NODE_FIELD(whereClause);
  	COPY_NODE_FIELD(returningList);
  	COPY_NODE_FIELD(withClause);
+ 	COPY_NODE_FIELD(limitClause);
  
  	return newnode;
  }
***************
*** 2547,2552 **** _copyUpdateStmt(const UpdateStmt *from)
--- 2549,2555 ----
  	COPY_NODE_FIELD(fromClause);
  	COPY_NODE_FIELD(returningList);
  	COPY_NODE_FIELD(withClause);
+ 	COPY_NODE_FIELD(limitClause);
  
  	return newnode;
  }
*** a/src/backend/nodes/equalfuncs.c
--- b/src/backend/nodes/equalfuncs.c
***************
*** 897,902 **** _equalDeleteStmt(const DeleteStmt *a, const DeleteStmt *b)
--- 897,903 ----
  	COMPARE_NODE_FIELD(whereClause);
  	COMPARE_NODE_FIELD(returningList);
  	COMPARE_NODE_FIELD(withClause);
+ 	COMPARE_NODE_FIELD(limitClause);
  
  	return true;
  }
***************
*** 910,915 **** _equalUpdateStmt(const UpdateStmt *a, const UpdateStmt *b)
--- 911,917 ----
  	COMPARE_NODE_FIELD(fromClause);
  	COMPARE_NODE_FIELD(returningList);
  	COMPARE_NODE_FIELD(withClause);
+ 	COMPARE_NODE_FIELD(limitClause);
  
  	return true;
  }
*** a/src/backend/nodes/nodeFuncs.c
--- b/src/backend/nodes/nodeFuncs.c
***************
*** 2988,2993 **** raw_expression_tree_walker(Node *node,
--- 2988,2995 ----
  					return true;
  				if (walker(stmt->withClause, context))
  					return true;
+ 				if (walker(stmt->limitClause, context))
+ 					return true;
  			}
  			break;
  		case T_UpdateStmt:
***************
*** 3006,3011 **** raw_expression_tree_walker(Node *node,
--- 3008,3015 ----
  					return true;
  				if (walker(stmt->withClause, context))
  					return true;
+ 				if (walker(stmt->limitClause, context))
+ 					return true;
  			}
  			break;
  		case T_SelectStmt:
*** a/src/backend/nodes/outfuncs.c
--- b/src/backend/nodes/outfuncs.c
***************
*** 337,342 **** _outModifyTable(StringInfo str, const ModifyTable *node)
--- 337,343 ----
  	WRITE_NODE_FIELD(fdwPrivLists);
  	WRITE_NODE_FIELD(rowMarks);
  	WRITE_INT_FIELD(epqParam);
+ 	WRITE_NODE_FIELD(limitCount);
  }
  
  static void
*** a/src/backend/optimizer/plan/createplan.c
--- b/src/backend/optimizer/plan/createplan.c
***************
*** 4719,4725 **** make_modifytable(PlannerInfo *root,
  				 CmdType operation, bool canSetTag,
  				 List *resultRelations, List *subplans,
  				 List *withCheckOptionLists, List *returningLists,
! 				 List *rowMarks, int epqParam)
  {
  	ModifyTable *node = makeNode(ModifyTable);
  	Plan	   *plan = &node->plan;
--- 4719,4725 ----
  				 CmdType operation, bool canSetTag,
  				 List *resultRelations, List *subplans,
  				 List *withCheckOptionLists, List *returningLists,
! 				 List *rowMarks, int epqParam, Node *limitCount)
  {
  	ModifyTable *node = makeNode(ModifyTable);
  	Plan	   *plan = &node->plan;
***************
*** 4772,4777 **** make_modifytable(PlannerInfo *root,
--- 4772,4778 ----
  	node->returningLists = returningLists;
  	node->rowMarks = rowMarks;
  	node->epqParam = epqParam;
+ 	node->limitCount = limitCount;
  
  	/*
  	 * For each result relation that is a foreign table, allow the FDW to
*** a/src/backend/optimizer/plan/planner.c
--- b/src/backend/optimizer/plan/planner.c
***************
*** 610,616 **** subquery_planner(PlannerGlobal *glob, Query *parse,
  											 withCheckOptionLists,
  											 returningLists,
  											 rowMarks,
! 											 SS_assign_special_param(root));
  		}
  	}
  
--- 610,617 ----
  											 withCheckOptionLists,
  											 returningLists,
  											 rowMarks,
! 											 SS_assign_special_param(root),
! 											 parse->limitCount);
  		}
  	}
  
***************
*** 1054,1060 **** inheritance_planner(PlannerInfo *root)
  									 withCheckOptionLists,
  									 returningLists,
  									 rowMarks,
! 									 SS_assign_special_param(root));
  }
  
  /*--------------------
--- 1055,1062 ----
  									 withCheckOptionLists,
  									 returningLists,
  									 rowMarks,
! 									 SS_assign_special_param(root),
! 									 parse->limitCount);
  }
  
  /*--------------------
***************
*** 2473,2478 **** limit_needed(Query *parse)
--- 2475,2485 ----
  {
  	Node	   *node;
  
+ 	/* ModifyTable handles the LIMIT for us if this is an UPDATE or a DELETE */
+ 	if (parse->commandType == CMD_UPDATE ||
+ 		parse->commandType == CMD_DELETE)
+ 		return false;
+ 
  	node = parse->limitCount;
  	if (node)
  	{
*** a/src/backend/parser/analyze.c
--- b/src/backend/parser/analyze.c
***************
*** 386,391 **** transformDeleteStmt(ParseState *pstate, DeleteStmt *stmt)
--- 386,394 ----
  	qual = transformWhereClause(pstate, stmt->whereClause,
  								EXPR_KIND_WHERE, "WHERE");
  
+ 	qry->limitCount = transformLimitClause(pstate, stmt->limitClause,
+ 										   EXPR_KIND_LIMIT, "LIMIT");
+ 
  	qry->returningList = transformReturningList(pstate, stmt->returningList);
  
  	/* done building the range table and jointree */
***************
*** 1947,1952 **** transformUpdateStmt(ParseState *pstate, UpdateStmt *stmt)
--- 1950,1958 ----
  	qual = transformWhereClause(pstate, stmt->whereClause,
  								EXPR_KIND_WHERE, "WHERE");
  
+ 	qry->limitCount = transformLimitClause(pstate, stmt->limitClause,
+ 										   EXPR_KIND_LIMIT, "LIMIT");
+ 
  	qry->returningList = transformReturningList(pstate, stmt->returningList);
  
  	qry->rtable = pstate->p_rtable;
*** a/src/backend/parser/gram.y
--- b/src/backend/parser/gram.y
***************
*** 9163,9175 **** returning_clause:
   *****************************************************************************/
  
  DeleteStmt: opt_with_clause DELETE_P FROM relation_expr_opt_alias
! 			using_clause where_or_current_clause returning_clause
  				{
  					DeleteStmt *n = makeNode(DeleteStmt);
  					n->relation = $4;
  					n->usingClause = $5;
  					n->whereClause = $6;
! 					n->returningList = $7;
  					n->withClause = $1;
  					$$ = (Node *)n;
  				}
--- 9163,9181 ----
   *****************************************************************************/
  
  DeleteStmt: opt_with_clause DELETE_P FROM relation_expr_opt_alias
! 			using_clause where_or_current_clause opt_select_limit returning_clause
  				{
  					DeleteStmt *n = makeNode(DeleteStmt);
  					n->relation = $4;
  					n->usingClause = $5;
  					n->whereClause = $6;
! 					if (linitial($7) != NULL)
! 						ereport(ERROR,
! 								(errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
! 								 errmsg("OFFSET is not supported with DELETE"),
! 								 parser_errposition(@7)));
! 					n->limitClause = lsecond($7);
! 					n->returningList = $8;
  					n->withClause = $1;
  					$$ = (Node *)n;
  				}
***************
*** 9229,9234 **** UpdateStmt: opt_with_clause UPDATE relation_expr_opt_alias
--- 9235,9241 ----
  			SET set_clause_list
  			from_clause
  			where_or_current_clause
+ 			opt_select_limit
  			returning_clause
  				{
  					UpdateStmt *n = makeNode(UpdateStmt);
***************
*** 9236,9242 **** UpdateStmt: opt_with_clause UPDATE relation_expr_opt_alias
  					n->targetList = $5;
  					n->fromClause = $6;
  					n->whereClause = $7;
! 					n->returningList = $8;
  					n->withClause = $1;
  					$$ = (Node *)n;
  				}
--- 9243,9255 ----
  					n->targetList = $5;
  					n->fromClause = $6;
  					n->whereClause = $7;
! 					if (linitial($8) != NULL)
! 						ereport(ERROR,
! 								(errcode(ERRCODE_FEATURE_NOT_SUPPORTED),
! 								 errmsg("OFFSET is not supported with UPDATE"),
! 								 parser_errposition(@8)));
! 					n->limitClause = lsecond($8);
! 					n->returningList = $9;
  					n->withClause = $1;
  					$$ = (Node *)n;
  				}
*** a/src/include/nodes/execnodes.h
--- b/src/include/nodes/execnodes.h
***************
*** 1088,1094 **** typedef struct ModifyTableState
  	ResultRelInfo *resultRelInfo;		/* per-subplan target relations */
  	List	  **mt_arowmarks;	/* per-subplan ExecAuxRowMark lists */
  	EPQState	mt_epqstate;	/* for evaluating EvalPlanQual rechecks */
! 	bool		fireBSTriggers; /* do we need to fire stmt triggers? */
  } ModifyTableState;
  
  /* ----------------
--- 1088,1097 ----
  	ResultRelInfo *resultRelInfo;		/* per-subplan target relations */
  	List	  **mt_arowmarks;	/* per-subplan ExecAuxRowMark lists */
  	EPQState	mt_epqstate;	/* for evaluating EvalPlanQual rechecks */
! 	bool		firstCall;		/* is this our first time through? */
! 	ExprState   *limitCount;    /* LIMIT expression, if any */
! 	int64_t     maxProcessed;   /* maximum number of rows we're allowed to process */
! 	int64_t     processed;      /* number of rows we've processed so far */
  } ModifyTableState;
  
  /* ----------------
*** a/src/include/nodes/parsenodes.h
--- b/src/include/nodes/parsenodes.h
***************
*** 1060,1065 **** typedef struct DeleteStmt
--- 1060,1066 ----
  	Node	   *whereClause;	/* qualifications */
  	List	   *returningList;	/* list of expressions to return */
  	WithClause *withClause;		/* WITH clause */
+ 	Node	   *limitClause;	/* LIMIT clause */
  } DeleteStmt;
  
  /* ----------------------
***************
*** 1075,1080 **** typedef struct UpdateStmt
--- 1076,1082 ----
  	List	   *fromClause;		/* optional from clause for more tables */
  	List	   *returningList;	/* list of expressions to return */
  	WithClause *withClause;		/* WITH clause */
+ 	Node	   *limitClause;	/* LIMIT clause */
  } UpdateStmt;
  
  /* ----------------------
*** a/src/include/nodes/plannodes.h
--- b/src/include/nodes/plannodes.h
***************
*** 177,182 **** typedef struct ModifyTable
--- 177,183 ----
  	List	   *fdwPrivLists;	/* per-target-table FDW private data lists */
  	List	   *rowMarks;		/* PlanRowMarks (non-locking only) */
  	int			epqParam;		/* ID of Param for EvalPlanQual re-eval */
+ 	Node	   *limitCount;		/* maximum number of rows to process */
  } ModifyTable;
  
  /* ----------------
*** a/src/include/optimizer/planmain.h
--- b/src/include/optimizer/planmain.h
***************
*** 84,90 **** extern ModifyTable *make_modifytable(PlannerInfo *root,
  				 CmdType operation, bool canSetTag,
  				 List *resultRelations, List *subplans,
  				 List *withCheckOptionLists, List *returningLists,
! 				 List *rowMarks, int epqParam);
  extern bool is_projection_capable_plan(Plan *plan);
  
  /*
--- 84,90 ----
  				 CmdType operation, bool canSetTag,
  				 List *resultRelations, List *subplans,
  				 List *withCheckOptionLists, List *returningLists,
! 				 List *rowMarks, int epqParam, Node *limitCount);
  extern bool is_projection_capable_plan(Plan *plan);
  
  /*
*** a/src/test/regress/expected/delete.out
--- b/src/test/regress/expected/delete.out
***************
*** 31,33 **** SELECT id, a, char_length(b) FROM delete_test;
--- 31,104 ----
  (1 row)
  
  DROP TABLE delete_test;
+ -- LIMIT
+ CREATE TABLE delete_test AS
+ SELECT i*10 AS a FROM generate_series(1, 10) i;
+ DELETE FROM delete_test LIMIT -1;
+ ERROR:  LIMIT must not be negative
+ DELETE FROM delete_test OFFSET 0;
+ ERROR:  OFFSET is not supported with DELETE
+ LINE 1: DELETE FROM delete_test OFFSET 0;
+                                 ^
+ DELETE FROM delete_test LIMIT 0;
+ SELECT count(*) FROM delete_test;
+  count 
+ -------
+     10
+ (1 row)
+ 
+ DELETE FROM delete_test LIMIT 1;
+ SELECT count(*) FROM delete_test;
+  count 
+ -------
+      9
+ (1 row)
+ 
+ DELETE FROM delete_test LIMIT (SELECT 1);
+ SELECT count(*) FROM delete_test;
+  count 
+ -------
+      8
+ (1 row)
+ 
+ -- test against partitioned table
+ CREATE TABLE delete_test_child(a int) INHERITS (delete_test);
+ NOTICE:  merging column "a" with inherited definition
+ INSERT INTO delete_test_child VALUES (5), (15), (25);
+ SELECT count(*) FROM delete_test;
+  count 
+ -------
+     11
+ (1 row)
+ 
+ DELETE FROM delete_test LIMIT 5;
+ SELECT count(*) FROM delete_test;
+  count 
+ -------
+      6
+ (1 row)
+ 
+ DELETE FROM delete_test LIMIT 5;
+ SELECT count(*) FROM delete_test;
+  count 
+ -------
+      1
+ (1 row)
+ 
+ DELETE FROM delete_test LIMIT 5;
+ SELECT count(*) FROM delete_test;
+  count 
+ -------
+      0
+ (1 row)
+ 
+ EXPLAIN (COSTS OFF) DELETE FROM delete_test LIMIT 1;
+              QUERY PLAN              
+ -------------------------------------
+  Delete on delete_test
+    ->  Seq Scan on delete_test
+    ->  Seq Scan on delete_test_child
+ (3 rows)
+ 
+ DROP TABLE delete_test CASCADE;
+ NOTICE:  drop cascades to table delete_test_child
*** a/src/test/regress/expected/update.out
--- b/src/test/regress/expected/update.out
***************
*** 148,150 **** SELECT a, b, char_length(c) FROM update_test;
--- 148,221 ----
  (4 rows)
  
  DROP TABLE update_test;
+ -- LIMIT
+ CREATE TABLE update_test AS
+ SELECT i*10 AS a FROM generate_series(1, 10) i;
+ UPDATE update_test SET a = 0 LIMIT -1;
+ ERROR:  LIMIT must not be negative
+ UPDATE update_test SET a = 0 OFFSET 0;
+ ERROR:  OFFSET is not supported with UPDATE
+ LINE 1: UPDATE update_test SET a = 0 OFFSET 0;
+                                      ^
+ UPDATE update_test SET a = -1 LIMIT 0;
+ SELECT a FROM update_test WHERE a = -1;
+  a 
+ ---
+ (0 rows)
+ 
+ UPDATE update_test SET a = -1 LIMIT 1;
+ SELECT a FROM update_test WHERE a = -1;
+  a  
+ ----
+  -1
+ (1 row)
+ 
+ UPDATE update_test SET a = -1 WHERE a <> -1 LIMIT (SELECT 1);
+ SELECT a FROM update_test WHERE a = -1;
+  a  
+ ----
+  -1
+  -1
+ (2 rows)
+ 
+ -- test against partitioned table
+ CREATE TABLE update_test_child(a int) INHERITS (update_test);
+ NOTICE:  merging column "a" with inherited definition
+ INSERT INTO update_test_child VALUES (5), (15), (25);
+ SELECT count(*) FROM update_test WHERE a <> -1;
+  count 
+ -------
+     11
+ (1 row)
+ 
+ UPDATE update_test SET a = -1 WHERE a <> -1 LIMIT 5;
+ SELECT count(*) FROM update_test WHERE a <> -1;
+  count 
+ -------
+      6
+ (1 row)
+ 
+ UPDATE update_test SET a = -1 WHERE a <> -1 LIMIT 5;
+ SELECT count(*) FROM update_test WHERE a <> -1;
+  count 
+ -------
+      1
+ (1 row)
+ 
+ UPDATE update_test SET a = -1 WHERE a <> -1 LIMIT 5;
+ SELECT count(*) FROM update_test WHERE a <> -1;
+  count 
+ -------
+      0
+ (1 row)
+ 
+ EXPLAIN (COSTS OFF) UPDATE update_test SET a = 10 LIMIT 1;
+              QUERY PLAN              
+ -------------------------------------
+  Update on update_test
+    ->  Seq Scan on update_test
+    ->  Seq Scan on update_test_child
+ (3 rows)
+ 
+ DROP TABLE update_test CASCADE;
+ NOTICE:  drop cascades to table update_test_child
*** a/src/test/regress/sql/delete.sql
--- b/src/test/regress/sql/delete.sql
***************
*** 23,25 **** DELETE FROM delete_test WHERE a > 25;
--- 23,61 ----
  SELECT id, a, char_length(b) FROM delete_test;
  
  DROP TABLE delete_test;
+ 
+ -- LIMIT
+ 
+ CREATE TABLE delete_test AS
+ SELECT i*10 AS a FROM generate_series(1, 10) i;
+ 
+ DELETE FROM delete_test LIMIT -1;
+ 
+ DELETE FROM delete_test OFFSET 0;
+ 
+ DELETE FROM delete_test LIMIT 0;
+ SELECT count(*) FROM delete_test;
+ 
+ DELETE FROM delete_test LIMIT 1;
+ SELECT count(*) FROM delete_test;
+ 
+ DELETE FROM delete_test LIMIT (SELECT 1);
+ SELECT count(*) FROM delete_test;
+ 
+ -- test against partitioned table
+ CREATE TABLE delete_test_child(a int) INHERITS (delete_test);
+ INSERT INTO delete_test_child VALUES (5), (15), (25);
+ SELECT count(*) FROM delete_test;
+ 
+ DELETE FROM delete_test LIMIT 5;
+ SELECT count(*) FROM delete_test;
+ 
+ DELETE FROM delete_test LIMIT 5;
+ SELECT count(*) FROM delete_test;
+ 
+ DELETE FROM delete_test LIMIT 5;
+ SELECT count(*) FROM delete_test;
+ 
+ EXPLAIN (COSTS OFF) DELETE FROM delete_test LIMIT 1;
+ 
+ DROP TABLE delete_test CASCADE;
*** a/src/test/regress/sql/update.sql
--- b/src/test/regress/sql/update.sql
***************
*** 75,77 **** UPDATE update_test SET c = repeat('x', 10000) WHERE c = 'car';
--- 75,113 ----
  SELECT a, b, char_length(c) FROM update_test;
  
  DROP TABLE update_test;
+ 
+ -- LIMIT
+ 
+ CREATE TABLE update_test AS
+ SELECT i*10 AS a FROM generate_series(1, 10) i;
+ 
+ UPDATE update_test SET a = 0 LIMIT -1;
+ 
+ UPDATE update_test SET a = 0 OFFSET 0;
+ 
+ UPDATE update_test SET a = -1 LIMIT 0;
+ SELECT a FROM update_test WHERE a = -1;
+ 
+ UPDATE update_test SET a = -1 LIMIT 1;
+ SELECT a FROM update_test WHERE a = -1;
+ 
+ UPDATE update_test SET a = -1 WHERE a <> -1 LIMIT (SELECT 1);
+ SELECT a FROM update_test WHERE a = -1;
+ 
+ -- test against partitioned table
+ CREATE TABLE update_test_child(a int) INHERITS (update_test);
+ INSERT INTO update_test_child VALUES (5), (15), (25);
+ SELECT count(*) FROM update_test WHERE a <> -1;
+ 
+ UPDATE update_test SET a = -1 WHERE a <> -1 LIMIT 5;
+ SELECT count(*) FROM update_test WHERE a <> -1;
+ 
+ UPDATE update_test SET a = -1 WHERE a <> -1 LIMIT 5;
+ SELECT count(*) FROM update_test WHERE a <> -1;
+ 
+ UPDATE update_test SET a = -1 WHERE a <> -1 LIMIT 5;
+ SELECT count(*) FROM update_test WHERE a <> -1;
+ 
+ EXPLAIN (COSTS OFF) UPDATE update_test SET a = 10 LIMIT 1;
+ 
+ DROP TABLE update_test CASCADE;

Etsuro Fujita

fujita.etsuro@lab.ntt.co.jp

over 11 years ago

In reply to: Rukh Meski (#1)

Re: LIMIT for UPDATE and DELETE

Hi Rukh,

(2014/08/15 6:18), Rukh Meski wrote:

Based on the feedback on my previous patch, I've separated only the
LIMIT part into its own feature. This version plays nicely with
inheritance. The intended use is splitting up big UPDATEs and DELETEs
into batches more easily and efficiently.

Before looking into the patch, I'd like to know the use cases in more
details.

Thanks,

Best regards,
Etsuro Fujita

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Amit Kapila

amit.kapila16@gmail.com

over 11 years ago

In reply to: Etsuro Fujita (#2)

Re: LIMIT for UPDATE and DELETE

On Mon, Aug 25, 2014 at 12:18 PM, Etsuro Fujita <fujita.etsuro@lab.ntt.co.jp>
wrote:

(2014/08/15 6:18), Rukh Meski wrote:

Based on the feedback on my previous patch, I've separated only the
LIMIT part into its own feature. This version plays nicely with
inheritance. The intended use is splitting up big UPDATEs and DELETEs
into batches more easily and efficiently.

Before looking into the patch, I'd like to know the use cases in more

details.

You can once check the previous commit fest thread [1]https://commitfest.postgresql.org/action/patch_view?id=1412 for
this feature, in that you can find some use cases and what are
the difficulties to implement this feature, it might aid you in
review of this feature.

[1]: https://commitfest.postgresql.org/action/patch_view?id=1412
https://commitfest.postgresql.org/action/patch_view?id=1412

With Regards,
Amit Kapila.
EnterpriseDB: http://www.enterprisedb.com

Kevin Grittner

kgrittn@ymail.com

over 11 years ago

In reply to: Etsuro Fujita (#2)

Re: LIMIT for UPDATE and DELETE

Etsuro Fujita <fujita.etsuro@lab.ntt.co.jp> wrote:

(2014/08/15 6:18), Rukh Meski wrote:

Based on the feedback on my previous patch, I've separated only the
LIMIT part into its own feature. This version plays nicely with
inheritance. The intended use is splitting up big UPDATEs and DELETEs
into batches more easily and efficiently.

Before looking into the patch, I'd like to know the use cases in more
details.

There have been a few times I wanted something like this, so that
it was easier to do an update that affects a very high percentage
of rows in a table, while making the old version of the row no
longer match the selection criteria for the UPDATE. There are
workarounds using cursors or subselects returning ctid, but they
are kludgy and error prone. Basically I wanted to alternate UPDATE
of a subset of the rows with table VACUUM so that subsequent
iterations can re-use space and avoid bloating the table.

--
Kevin Grittner
EDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Jeff Janes

jeff.janes@gmail.com

over 11 years ago

In reply to: Etsuro Fujita (#2)

Re: LIMIT for UPDATE and DELETE

On Sun, Aug 24, 2014 at 11:48 PM, Etsuro Fujita <fujita.etsuro@lab.ntt.co.jp

wrote:

Hi Rukh,

(2014/08/15 6:18), Rukh Meski wrote:

Based on the feedback on my previous patch, I've separated only the
LIMIT part into its own feature. This version plays nicely with
inheritance. The intended use is splitting up big UPDATEs and DELETEs
into batches more easily and efficiently.

Before looking into the patch, I'd like to know the use cases in more
details.

There are two common use cases I can think of:

I've just added a column to an existing table, and it is all NULL. I've
changed the code to populate that column appropriately for new or updated
rows, but I need to back fill the existing rows. I have a (slow) method to
compute the new value. (I've not yet changed the code to depend on that
column being populated)

The obvious solution is:

update the_table set new_col=populate_new_col(whatever) where new_col is
null.

But this will bloat the table because vacuum cannot intervene, and will
take a very long time. The first row to be update will remain locked until
the last row gets updated, which is not acceptable. And if something goes
wrong before the commit, you've lost all the work.

With the limit clause, you can just do this:

update the_table set new_col=populate_new_col(whatever) where new_col is
null limit 50000;

In a loop with appropriate vacuuming and throttling.

I've introduced or re-designed partitioning, and need to migrate rows to
the appropriate partitions without long lived row locks.

create table pgbench_accounts2 () inherits (pgbench_accounts);

and then in a loop:

with t as (delete from only pgbench_accounts where aid < 500000 limit 5000
returning *)
insert into pgbench_accounts2 select * from t;

Cheers,

Jeff

Etsuro Fujita

fujita.etsuro@lab.ntt.co.jp

over 11 years ago

In reply to: Etsuro Fujita (#2)

Re: LIMIT for UPDATE and DELETE

(2014/08/25 15:48), Etsuro Fujita wrote:

(2014/08/15 6:18), Rukh Meski wrote:

Based on the feedback on my previous patch, I've separated only the
LIMIT part into its own feature. This version plays nicely with
inheritance. The intended use is splitting up big UPDATEs and DELETEs
into batches more easily and efficiently.

Before looking into the patch, I'd like to know the use cases in more
details.

Thanks for the input, Amit, Kevin and Jeff! I understand that the patch
is useful.

I've looked at the patch a bit closely. Here is my initial thought
about the patch.

The patch places limit-counting inside ModifyTable, and works well for
inheritance trees, but I'm not sure that that is the right way to go. I
think that this feature should be implemented in the way that we can
naturally extend it to the ORDER-BY-LIMIT case in future. But honestly
the patch doesn't seem to take into account that, I might be missing
something, though. What plan do you have for the future extensibility?

I think that the approach discussed in [1]/messages/by-id/26819.1291133045@sss.pgh.pa.us would be promissing, so ISTM
that it would be better to implement this feature by allowing for plans
in the form of eg, ModifyTModifyTable+Limit+Append.

Thanks,

[1]: /messages/by-id/26819.1291133045@sss.pgh.pa.us

Best regards,
Etsuro Fujita

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Marko Tiikkaja

marko@joh.to

over 11 years ago

In reply to: Etsuro Fujita (#6)

Re: LIMIT for UPDATE and DELETE

On 8/29/14 12:20 PM, Etsuro Fujita wrote:

The patch places limit-counting inside ModifyTable, and works well for
inheritance trees, but I'm not sure that that is the right way to go. I
think that this feature should be implemented in the way that we can
naturally extend it to the ORDER-BY-LIMIT case in future. But honestly
the patch doesn't seem to take into account that, I might be missing
something, though.

The LIMIT part *has* to happen after the rows have been locked or it
will work very surprisingly under concurrency (sort of like how FOR
SHARE / FOR UPDATE worked before 9.0). So either it has to be inside
ModifyTable or the ModifyTable has to somehow pass something to a Limit
node on top of it which would then realize that the tuples from
ModifyTable aren't supposed to be sent to the client (unless there's a
RETURNING clause). I think it's a lot nicer to do the LIMITing inside
ModifyTable, even though that duplicates a small portion of code that
already exists in the Limit node.

What plan do you have for the future extensibility?

I think that the approach discussed in [1] would be promissing, so ISTM
that it would be better to implement this feature by allowing for plans
in the form of eg, ModifyTModifyTable+Limit+Append.

I don't see an approach discussed there, just a listing of problems with
no solutions.

This is just my personal opinion, but what I think should happen is:

1) We put the LIMIT inside ModifyTable like this patch does. This
doesn't prevent us from doing ORDER BY in the future, but helps numerous
people who today have to
2) We allow ORDER BY on tables with no inheritance children using
something similar to Rukh's previous patch.
3) Someone rewrites how UPDATE works based on Tom's suggestion here:
/messages/by-id/1598.1399826841@sss.pgh.pa.us,
which allows us to support ORDER BY on all tables (or perhaps maybe not
FDWs, I don't know how those work). The LIMIT functionality in this
patch is unaffected.

Now, I know some people disagree with me on whether step #2 is worth
taking or not, but that's a separate discussion. My point w.r.t. this
patch still stands: I don't see any forwards compatibility problems with
this approach, nor do I really see any viable alternatives either.

.marko

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Marko Tiikkaja

marko@joh.to

over 11 years ago

In reply to: Marko Tiikkaja (#7)

Re: LIMIT for UPDATE and DELETE

On 8/29/14 1:53 PM, I wrote:

This is just my personal opinion, but what I think should happen is:

1) We put the LIMIT inside ModifyTable like this patch does. This
doesn't prevent us from doing ORDER BY in the future, but helps numerous
people who today have to

Oops, looks like I didn't finish my thought here.

.. but helps numerous people who today have to achieve the same thing
via tedious, slow and problematic subqueries (or a choose-any-two
combination of these).

.marko

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Tom Lane

tgl@sss.pgh.pa.us

over 11 years ago

In reply to: Marko Tiikkaja (#7)

Re: LIMIT for UPDATE and DELETE

Marko Tiikkaja <marko@joh.to> writes:

The LIMIT part *has* to happen after the rows have been locked or it
will work very surprisingly under concurrency (sort of like how FOR
SHARE / FOR UPDATE worked before 9.0).

Good point.

So either it has to be inside
ModifyTable or the ModifyTable has to somehow pass something to a Limit
node on top of it

... or we add a LockRows node below the Limit node. Yeah, that would make
UPDATE/LIMIT a tad slower, but I think that might be preferable to what
you're proposing anyway. Raw speed of what is fundamentally a fringe
feature ought not trump every other concern.

This is just my personal opinion, but what I think should happen is:

1) We put the LIMIT inside ModifyTable like this patch does. This
doesn't prevent us from doing ORDER BY in the future, but helps numerous
people who today have to
2) We allow ORDER BY on tables with no inheritance children using
something similar to Rukh's previous patch.
3) Someone rewrites how UPDATE works based on Tom's suggestion here:
/messages/by-id/1598.1399826841@sss.pgh.pa.us,
which allows us to support ORDER BY on all tables (or perhaps maybe not
FDWs, I don't know how those work). The LIMIT functionality in this
patch is unaffected.

I still think we should skip #2 and go directly to work on #3. Getting
rid of the unholy mess that is inheritance_planner would be a very nice
thing.

regards, tom lane

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#10

Marko Tiikkaja

marko@joh.to

over 11 years ago

In reply to: Tom Lane (#9)

Re: LIMIT for UPDATE and DELETE

On 8/29/14 4:33 PM, Tom Lane wrote:

So either it has to be inside
ModifyTable or the ModifyTable has to somehow pass something to a Limit
node on top of it

... or we add a LockRows node below the Limit node. Yeah, that would make
UPDATE/LIMIT a tad slower, but I think that might be preferable to what
you're proposing anyway. Raw speed of what is fundamentally a fringe
feature ought not trump every other concern.

I don't consider this a fringe feature, but in any case, the main use
case for LIMIT without ORDER BY in UPDATE and DELETE is to split up
large transactions into smaller batches. And considering that, I think
raw speed should be a concern (though it shouldn't trump every other
concern, obviously).

More to the point, personally, I think the changes to nodeModifyTable.c
are very reasonable so it's not clear to me that the "extra
LockRows+Limit nodes" approach would be inherently better (even ignoring
performance concerns).

This is just my personal opinion, but what I think should happen is:

2) We allow ORDER BY on tables with no inheritance children using
something similar to Rukh's previous patch.
3) Someone rewrites how UPDATE works based on Tom's suggestion here:
/messages/by-id/1598.1399826841@sss.pgh.pa.us,

I still think we should skip #2 and go directly to work on #3. Getting
rid of the unholy mess that is inheritance_planner would be a very nice
thing.

Ideally? Yeah, that would be great. But I don't see anyone
volunteering to do that work, and I think holding back a useful feature
(ORDER BY with UPDATE/DELETE) in hopes of getting someone to volunteer
to do it is insane. Now, you're free to argue that ORDER BY with
UPDATE/DELETE isn't that useful, of course, but I'm sure there are lots
of people who agree with me.

.marko

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#11

Robert Haas

robertmhaas@gmail.com

over 11 years ago

In reply to: Marko Tiikkaja (#10)

Re: LIMIT for UPDATE and DELETE

On Mon, Sep 1, 2014 at 8:06 AM, Marko Tiikkaja <marko@joh.to> wrote:

Ideally? Yeah, that would be great. But I don't see anyone volunteering to
do that work, and I think holding back a useful feature (ORDER BY with
UPDATE/DELETE) in hopes of getting someone to volunteer to do it is insane.
Now, you're free to argue that ORDER BY with UPDATE/DELETE isn't that
useful, of course, but I'm sure there are lots of people who agree with me.

I still agree with Tom. Arbitrary restrictions on which features can
be used in combination with each other piss off and alienate users.
We've put quite a bit of effort into making table inheritance not suck
(e.g. statistics on inheritance trees, Merge Append, etc.). Making it
suck more because you don't think it's as important as your feature
is, in my opinion, not cool.

This is not to say that I don't like the feature. I like it a lot.
But I like a product where you can be sure that if walking works and
chewing gum works you can also walk and chew gum at the same time even
more.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#12

Marko Tiikkaja

marko@joh.to

over 11 years ago

In reply to: Robert Haas (#11)

Re: LIMIT for UPDATE and DELETE

On 9/3/14 4:46 PM, Robert Haas wrote:

Making it
suck more because you don't think it's as important as your feature
is, in my opinion, not cool.

I really can't see how that would make inheritance suck *more*. You
can't do UPDATE .. ORDER BY now, and you wouldn't be able to do it after
the patch. Yeah, sure, perhaps people using inheritance might feel left
out, but surely that would just motivate them to work on improving it.

.marko

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#13

Robert Haas

robertmhaas@gmail.com

over 11 years ago

In reply to: Marko Tiikkaja (#12)

Re: LIMIT for UPDATE and DELETE

On Wed, Sep 3, 2014 at 11:02 AM, Marko Tiikkaja <marko@joh.to> wrote:

On 9/3/14 4:46 PM, Robert Haas wrote:

Making it
suck more because you don't think it's as important as your feature
is, in my opinion, not cool.

I really can't see how that would make inheritance suck *more*. You can't
do UPDATE .. ORDER BY now, and you wouldn't be able to do it after the
patch. Yeah, sure, perhaps people using inheritance might feel left out,
but surely that would just motivate them to work on improving it.

I think it's entirely reasonable for us to require that all new SQL
features should be required to work with or without inheritance. If
we took the opposition position, and said that the only things that
need to work with inheritance are the ones that existed at the time
inheritance was introduced, then we'd not need to worry about it not
only for this feature but for row-level security and SKIP LOCKED and
GROUPING SETS and, going back a bit further, materialized views and
security-barrier views and LATERAL and CTEs and on and on. Perhaps
not all of those require any special handling for inheritance
hierarchies, but some of them surely did, and if even 10% of the SQL
features that we have added since table inheritance were allowed to
opt out of supporting it, we'd have a broken and unusable feature
today.

Now some people might argue that we have that anyway, but the fact is
that a lot of people are using inheritance today, even with all its
flaws, and they wouldn't be if there were a long laundry list of
limitations that didn't apply to regular tables. We should be looking
to lift the limitations that currently exist, not add more.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#14

Heikki Linnakangas

hlinnakangas@vmware.com

over 11 years ago

In reply to: Robert Haas (#13)

Re: LIMIT for UPDATE and DELETE

On 09/03/2014 06:39 PM, Robert Haas wrote:

On Wed, Sep 3, 2014 at 11:02 AM, Marko Tiikkaja <marko@joh.to> wrote:

On 9/3/14 4:46 PM, Robert Haas wrote:

Making it
suck more because you don't think it's as important as your feature
is, in my opinion, not cool.

I really can't see how that would make inheritance suck *more*. You can't
do UPDATE .. ORDER BY now, and you wouldn't be able to do it after the
patch. Yeah, sure, perhaps people using inheritance might feel left out,
but surely that would just motivate them to work on improving it.

I think it's entirely reasonable for us to require that all new SQL
features should be required to work with or without inheritance. If
we took the opposition position, and said that the only things that
need to work with inheritance are the ones that existed at the time
inheritance was introduced, then we'd not need to worry about it not
only for this feature but for row-level security and SKIP LOCKED and
GROUPING SETS and, going back a bit further, materialized views and
security-barrier views and LATERAL and CTEs and on and on. Perhaps
not all of those require any special handling for inheritance
hierarchies, but some of them surely did, and if even 10% of the SQL
features that we have added since table inheritance were allowed to
opt out of supporting it, we'd have a broken and unusable feature
today.

Now some people might argue that we have that anyway, but the fact is
that a lot of people are using inheritance today, even with all its
flaws, and they wouldn't be if there were a long laundry list of
limitations that didn't apply to regular tables. We should be looking
to lift the limitations that currently exist, not add more.

I agree. If we are to support UPDATE .. ORDER BY .. LIMIT, it should
work with inheritance. So the path forward is (using Marko's phrasing
upthread):

1) We put the LIMIT inside ModifyTable like this patch does. This
doesn't prevent us from doing ORDER BY in the future, but helps numerous
people who today have to
2) Someone rewrites how UPDATE works based on Tom's suggestion here:
/messages/by-id/1598.1399826841@sss.pgh.pa.us,
which allows us to support ORDER BY on all tables (or perhaps maybe not
FDWs, I don't know how those work). The LIMIT functionality in this
patch is unaffected.

What's not clear to me is whether it make sense to do 1) without 2) ? Is
UPDATE .. LIMIT without support for an ORDER BY useful enough? And if we
apply this patch now, how much of it needs to be rewritten after 2) ? If
the answers are "yes" and "not much", then we should review this patch
now, and put 2) on the TODO list. Otherwise 2) should do done first.

Etsuro, Kaigei, please take a look at the patch and try to make a guess
on how much of this still needs to be rewritten if we do 2). If not
much, then please continue to review it, with the aim of getting it into
a committable state.

- Heikki

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#15

Marko Tiikkaja

marko@joh.to

over 11 years ago

In reply to: Heikki Linnakangas (#14)

Re: LIMIT for UPDATE and DELETE

On 9/9/14 11:57 AM, Heikki Linnakangas wrote:

What's not clear to me is whether it make sense to do 1) without 2) ? Is
UPDATE .. LIMIT without support for an ORDER BY useful enough?

I'd say so; I could use it right now. I have to remove millions of
lines from a table, but I don't want the live transaction processing to
take a hit, so I have to do it in batches. Granted, some kind of rate
limiting would achieve the same thing for DELETE, but with UPDATE you
also have to consider row locking, and rate limiting wouldn't help with
that at all; it would, in fact, just make it worse. I'll also be
running a big UPDATE like that later today, so that's two uses today
alone for me. And no, these are not routine things so keep your "use
partitions" suggestions to yourselves.

And if we
apply this patch now, how much of it needs to be rewritten after 2) ? If
the answers are "yes" and "not much", then we should review this patch
now, and put 2) on the TODO list. Otherwise 2) should do done first.

I'd say "not much, if anything at all".

.marko

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#16

Marko Tiikkaja

marko@joh.to

over 11 years ago

In reply to: Marko Tiikkaja (#15)

Re: LIMIT for UPDATE and DELETE

On 9/9/14 12:37 PM, I wrote:

And no, these are not routine things so keep your "use
partitions" suggestions to yourselves.

My apologies. This was not directed at you personally, Heikki, and in
any case it was unnecessarily hostile.

.marko

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#17

Heikki Linnakangas

hlinnakangas@vmware.com

over 11 years ago

In reply to: Marko Tiikkaja (#16)

Re: LIMIT for UPDATE and DELETE

On 09/09/2014 01:46 PM, Marko Tiikkaja wrote:

On 9/9/14 12:37 PM, I wrote:

And no, these are not routine things so keep your "use
partitions" suggestions to yourselves.

My apologies. This was not directed at you personally, Heikki, and in
any case it was unnecessarily hostile.

No worries, it made me smile :-)

- Heikki

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#18

Jeff Janes

jeff.janes@gmail.com

over 11 years ago

In reply to: Heikki Linnakangas (#14)

Re: LIMIT for UPDATE and DELETE

On Tue, Sep 9, 2014 at 2:57 AM, Heikki Linnakangas <hlinnakangas@vmware.com>
wrote:

I agree. If we are to support UPDATE .. ORDER BY .. LIMIT, it should work
with inheritance. So the path forward is (using Marko's phrasing upthread):

1) We put the LIMIT inside ModifyTable like this patch does. This
doesn't prevent us from doing ORDER BY in the future, but helps numerous
people who today have to
2) Someone rewrites how UPDATE works based on Tom's suggestion here:
/messages/by-id/1598.1399826841@sss.pgh.pa.us,
which allows us to support ORDER BY on all tables (or perhaps maybe not
FDWs, I don't know how those work). The LIMIT functionality in this
patch is unaffected.

What's not clear to me is whether it make sense to do 1) without 2) ? Is
UPDATE .. LIMIT without support for an ORDER BY useful enough?

I've wanted LIMIT even without ORDER BY many times, so I'd vote yes.

And if we apply this patch now, how much of it needs to be rewritten after
2) ? If the answers are "yes" and "not much", then we should review this
patch now, and put 2) on the TODO list. Otherwise 2) should do done first.

On that I can't give any useful feedback.

Cheers,

Jeff

#19

Etsuro Fujita

fujita.etsuro@lab.ntt.co.jp

over 11 years ago

In reply to: Heikki Linnakangas (#14)

Re: LIMIT for UPDATE and DELETE

(2014/09/09 18:57), Heikki Linnakangas wrote:

On 09/03/2014 06:39 PM, Robert Haas wrote:

Now some people might argue that we have that anyway, but the fact is
that a lot of people are using inheritance today, even with all its
flaws, and they wouldn't be if there were a long laundry list of
limitations that didn't apply to regular tables. We should be looking
to lift the limitations that currently exist, not add more.

I agree. If we are to support UPDATE .. ORDER BY .. LIMIT, it should
work with inheritance. So the path forward is (using Marko's phrasing
upthread):

1) We put the LIMIT inside ModifyTable like this patch does. This
doesn't prevent us from doing ORDER BY in the future, but helps numerous
people who today have to
2) Someone rewrites how UPDATE works based on Tom's suggestion here:
/messages/by-id/1598.1399826841@sss.pgh.pa.us,
which allows us to support ORDER BY on all tables (or perhaps maybe not
FDWs, I don't know how those work). The LIMIT functionality in this
patch is unaffected.

What's not clear to me is whether it make sense to do 1) without 2) ? Is
UPDATE .. LIMIT without support for an ORDER BY useful enough? And if we
apply this patch now, how much of it needs to be rewritten after 2) ? If
the answers are "yes" and "not much", then we should review this patch
now, and put 2) on the TODO list. Otherwise 2) should do done first.

My answers are "yes" but "completely rewritten". So, I think we should
work on 2) first ideally, but 2) seems a large project at least to me.
So, I think it would be reasonable to implement UPDATE/DELETE .. LIMIT
based on Rukh' patch for now and put 2) and the re-implementation of 1)
after 2) on the TODO list.

I don't have enough time to review it for a while, so I'd like to work
on it (and postgres_fdw's "update pushdown" enhancement related to it)
at the next CF (I think I can do it earlier). I must apologize to Rukh
for not having enough time for the patch review.

Thanks,

Best regards,
Etsuro Fujita

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#20

Marko Tiikkaja

marko@joh.to

over 11 years ago

In reply to: Etsuro Fujita (#19)

Re: LIMIT for UPDATE and DELETE

On 2014-09-10 04:25, Etsuro Fujita wrote:

(2014/09/09 18:57), Heikki Linnakangas wrote:

What's not clear to me is whether it make sense to do 1) without 2) ? Is
UPDATE .. LIMIT without support for an ORDER BY useful enough? And if we
apply this patch now, how much of it needs to be rewritten after 2) ? If
the answers are "yes" and "not much", then we should review this patch
now, and put 2) on the TODO list. Otherwise 2) should do done first.

My answers are "yes" but "completely rewritten".

Any particular reason for you to say that? Because an UPDATE might have
a RETURNING clause, all the updated tuples have to go through the
ModifyTable node one at a time. I don't see why we couldn't LIMIT there
after implementing #2.

.marko

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#21

Etsuro Fujita

fujita.etsuro@lab.ntt.co.jp

over 11 years ago

In reply to: Marko Tiikkaja (#20)

Re: LIMIT for UPDATE and DELETE

(2014/09/10 16:57), Marko Tiikkaja wrote:

On 2014-09-10 04:25, Etsuro Fujita wrote:

(2014/09/09 18:57), Heikki Linnakangas wrote:

What's not clear to me is whether it make sense to do 1) without 2) ? Is
UPDATE .. LIMIT without support for an ORDER BY useful enough? And if we
apply this patch now, how much of it needs to be rewritten after 2) ? If
the answers are "yes" and "not much", then we should review this patch
now, and put 2) on the TODO list. Otherwise 2) should do done first.

My answers are "yes" but "completely rewritten".

Any particular reason for you to say that? Because an UPDATE might have
a RETURNING clause, all the updated tuples have to go through the
ModifyTable node one at a time. I don't see why we couldn't LIMIT there
after implementing #2.

The reason is because I think that after implementing #2, we should
re-implement this feature by extending the planner to produce a plan
tree such as ModifyTable+Limit+Append, maybe with LockRows below the
Limit node. Such an approach would prevent duplication of the LIMIT
code in ModifyTable, making the ModifyTable code more simple, I think.

Thanks,

Best regards,
Etsuro Fujita

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#22

Marko Tiikkaja

marko@joh.to

over 11 years ago

In reply to: Etsuro Fujita (#21)

Re: LIMIT for UPDATE and DELETE

On 9/10/14 11:25 AM, Etsuro Fujita wrote:

The reason is because I think that after implementing #2, we should
re-implement this feature by extending the planner to produce a plan
tree such as ModifyTable+Limit+Append, maybe with LockRows below the
Limit node. Such an approach would prevent duplication of the LIMIT
code in ModifyTable, making the ModifyTable code more simple, I think.

You can already change *this patch* to do ModifyTable <- Limit <-
LockRows. If we think that's what we want, we should rewrite this patch
right now. This isn't a reason not to implement LIMIT without ORDER BY.

Like I said upthread, I think LockRows is a bad idea, but I'll need to
run some performance tests first. But whichever method we decide to
implement for this patch shouldn't need to be touched when the changes
to UPDATE land, so your reasoning is incorrect.

.marko

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#23

Etsuro Fujita

fujita.etsuro@lab.ntt.co.jp

over 11 years ago

In reply to: Marko Tiikkaja (#22)

Re: LIMIT for UPDATE and DELETE

(2014/09/10 18:33), Marko Tiikkaja wrote:

On 9/10/14 11:25 AM, Etsuro Fujita wrote:

The reason is because I think that after implementing #2, we should
re-implement this feature by extending the planner to produce a plan
tree such as ModifyTable+Limit+Append, maybe with LockRows below the
Limit node. Such an approach would prevent duplication of the LIMIT
code in ModifyTable, making the ModifyTable code more simple, I think.

You can already change *this patch* to do ModifyTable <- Limit <-
LockRows. If we think that's what we want, we should rewrite this patch
right now.

I think it might be relatively easy to do that for single-table cases,
but for inheritance cases, inheritance_planner needs work and I'm not
sure how much work it would take ...

Like I said upthread, I think LockRows is a bad idea, but I'll need to
run some performance tests first. But whichever method we decide to
implement for this patch shouldn't need to be touched when the changes
to UPDATE land, so your reasoning is incorrect.

Yeah, as you say, we need the performance tests, and I think it would
depend on those results whether LockRows is a bad idea or not.

Thanks,

Best regards,
Etsuro Fujita

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#24

Marko Tiikkaja

marko@joh.to

over 11 years ago

In reply to: Etsuro Fujita (#23)

Re: LIMIT for UPDATE and DELETE

On 9/10/14 12:05 PM, Etsuro Fujita wrote:

(2014/09/10 18:33), Marko Tiikkaja wrote:

You can already change *this patch* to do ModifyTable <- Limit <-
LockRows. If we think that's what we want, we should rewrite this patch
right now.

I think it might be relatively easy to do that for single-table cases,
but for inheritance cases, inheritance_planner needs work and I'm not
sure how much work it would take ...

Uh. Yeah, I think I'm an idiot and you're right.

I'll try and get some benchmarking done and see what comes out.

.marko

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#25

Etsuro Fujita

fujita.etsuro@lab.ntt.co.jp

over 11 years ago

In reply to: Rukh Meski (#1)

Re: LIMIT for UPDATE and DELETE

(2014/08/15 6:18), Rukh Meski wrote:

Based on the feedback on my previous patch, I've separated only the
LIMIT part into its own feature. This version plays nicely with
inheritance. The intended use is splitting up big UPDATEs and DELETEs
into batches more easily and efficiently.

IIUC, the patch doesn't support OFFSET with UPDATE/DELETE ... LIMIT. Is
that OK? When we support ORDER BY ... LIMIT/OFFSET, we will also be
allowing for OFFSET with UPDATE/DELETE ... LIMIT. So, ISTM it would be
better for the patch to support OFFSET at this point. No?

Thanks,

Best regards,
Etsuro Fujita

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#26

Kevin Grittner

kgrittn@ymail.com

over 11 years ago

In reply to: Etsuro Fujita (#25)

Re: LIMIT for UPDATE and DELETE

Etsuro Fujita <fujita.etsuro@lab.ntt.co.jp> wrote:

(2014/08/15 6:18), Rukh Meski wrote:

Based on the feedback on my previous patch, I've separated only the
LIMIT part into its own feature. This version plays nicely with
inheritance. The intended use is splitting up big UPDATEs and DELETEs
into batches more easily and efficiently.

IIUC, the patch doesn't support OFFSET with UPDATE/DELETE ... LIMIT. Is
that OK? When we support ORDER BY ... LIMIT/OFFSET, we will also be
allowing for OFFSET with UPDATE/DELETE ... LIMIT. So, ISTM it would be
better for the patch to support OFFSET at this point. No?

Without ORDER BY you really would have no idea *which* rows the
OFFSET would be skipping. Even more dangerously, you might *think*
you do, and get a surprise when you see the results (if, for
example, a seqscan starts at a point other than the start of the
heap, due to a concurrent seqscan from an unrelated query). It
might be better not to provide an illusion of a degree of control
you don't have, especially for UPDATE and DELETE operations.

--
Kevin Grittner
EDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#27

Robert Haas

robertmhaas@gmail.com

over 11 years ago

In reply to: Kevin Grittner (#26)

Re: LIMIT for UPDATE and DELETE

On Tue, Sep 16, 2014 at 11:31 AM, Kevin Grittner <kgrittn@ymail.com> wrote:

Etsuro Fujita <fujita.etsuro@lab.ntt.co.jp> wrote:

(2014/08/15 6:18), Rukh Meski wrote:

Based on the feedback on my previous patch, I've separated only the
LIMIT part into its own feature. This version plays nicely with
inheritance. The intended use is splitting up big UPDATEs and DELETEs
into batches more easily and efficiently.

IIUC, the patch doesn't support OFFSET with UPDATE/DELETE ... LIMIT. Is
that OK? When we support ORDER BY ... LIMIT/OFFSET, we will also be
allowing for OFFSET with UPDATE/DELETE ... LIMIT. So, ISTM it would be
better for the patch to support OFFSET at this point. No?

Without ORDER BY you really would have no idea *which* rows the
OFFSET would be skipping. Even more dangerously, you might *think*
you do, and get a surprise when you see the results (if, for
example, a seqscan starts at a point other than the start of the
heap, due to a concurrent seqscan from an unrelated query). It
might be better not to provide an illusion of a degree of control
you don't have, especially for UPDATE and DELETE operations.

Fair point, but I'd lean toward including it. I think we all agree
the end goal is ORDER BY .. LIMIT, and there OFFSET certainly has
meaning. If we don't include it now, we'll just end up adding it
later. It makes for fewer patches, and fewer changes for users, if we
do it all at once.

--
Robert Haas
EnterpriseDB: http://www.enterprisedb.com
The Enterprise PostgreSQL Company

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#28

Etsuro Fujita

fujita.etsuro@lab.ntt.co.jp

over 11 years ago

In reply to: Robert Haas (#27)

Re: LIMIT for UPDATE and DELETE

(2014/09/17 1:58), Robert Haas wrote:

On Tue, Sep 16, 2014 at 11:31 AM, Kevin Grittner <kgrittn@ymail.com> wrote:

Etsuro Fujita <fujita.etsuro@lab.ntt.co.jp> wrote:

(2014/08/15 6:18), Rukh Meski wrote:

Based on the feedback on my previous patch, I've separated only the
LIMIT part into its own feature. This version plays nicely with
inheritance. The intended use is splitting up big UPDATEs and DELETEs
into batches more easily and efficiently.

IIUC, the patch doesn't support OFFSET with UPDATE/DELETE ... LIMIT. Is
that OK? When we support ORDER BY ... LIMIT/OFFSET, we will also be
allowing for OFFSET with UPDATE/DELETE ... LIMIT. So, ISTM it would be
better for the patch to support OFFSET at this point. No?

Without ORDER BY you really would have no idea *which* rows the
OFFSET would be skipping. Even more dangerously, you might *think*
you do, and get a surprise when you see the results (if, for
example, a seqscan starts at a point other than the start of the
heap, due to a concurrent seqscan from an unrelated query). It
might be better not to provide an illusion of a degree of control
you don't have, especially for UPDATE and DELETE operations.

Fair point, but I'd lean toward including it. I think we all agree
the end goal is ORDER BY .. LIMIT, and there OFFSET certainly has
meaning. If we don't include it now, we'll just end up adding it
later. It makes for fewer patches, and fewer changes for users, if we
do it all at once.

I agree with Robert.

Rukh, what do you think as an author?

Thanks,

Best regards,
Etsuro Fujita

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

#29

Heikki Linnakangas

hlinnakangas@vmware.com

over 11 years ago

In reply to: Etsuro Fujita (#28)

Re: LIMIT for UPDATE and DELETE

On 09/24/2014 05:22 AM, Etsuro Fujita wrote:

(2014/09/17 1:58), Robert Haas wrote:

On Tue, Sep 16, 2014 at 11:31 AM, Kevin Grittner <kgrittn@ymail.com> wrote:

Etsuro Fujita <fujita.etsuro@lab.ntt.co.jp> wrote:

(2014/08/15 6:18), Rukh Meski wrote:

Based on the feedback on my previous patch, I've separated only the
LIMIT part into its own feature. This version plays nicely with
inheritance. The intended use is splitting up big UPDATEs and DELETEs
into batches more easily and efficiently.

IIUC, the patch doesn't support OFFSET with UPDATE/DELETE ... LIMIT. Is
that OK? When we support ORDER BY ... LIMIT/OFFSET, we will also be
allowing for OFFSET with UPDATE/DELETE ... LIMIT. So, ISTM it would be
better for the patch to support OFFSET at this point. No?

Without ORDER BY you really would have no idea *which* rows the
OFFSET would be skipping. Even more dangerously, you might *think*
you do, and get a surprise when you see the results (if, for
example, a seqscan starts at a point other than the start of the
heap, due to a concurrent seqscan from an unrelated query). It
might be better not to provide an illusion of a degree of control
you don't have, especially for UPDATE and DELETE operations.

Fair point, but I'd lean toward including it. I think we all agree
the end goal is ORDER BY .. LIMIT, and there OFFSET certainly has
meaning. If we don't include it now, we'll just end up adding it
later. It makes for fewer patches, and fewer changes for users, if we
do it all at once.

I agree with Robert.

Rukh, what do you think as an author?

I have marked this as "returned with feedback" in the commitfest app.
What I'd like to see happen next is:

Rewrite how UPDATEs work, per Tom's suggestion here:
/messages/by-id/1598.1399826841@sss.pgh.pa.us. Then
implement ORDER BY ... LIMIT on top of that.

A lot of people would also be willing to settle for just implementing
LIMIT without ORDER BY, as a stopgap measure. But the UPDATE rewrite is
what would make everyone most happy.

- Heikki

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers